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PHAKM&CBUTICftL COMPOSITION, C0NTAINXK6 FRRjGMENTS OF AN ANTIGEaHC 
PROTEIN ENCODING DNA ENDOWED TOTS AKTI-TIIiSIOR EPPECT. 

Field of the invention 
5 The invention relates to a pool of DNA plasinid. constructs 

containing the sequences of human MUC-l encoding fragments and 
to a pool of DNA plaistnids in v/hich the fragments theniselves are 
preceded by the sequence encoding a protein consisting of human 
ubic[uitin fused to a bacterial Lad fragment. The in-vention 
10 fTorther relates to their use in the preparation of 
pharmaceutical compositions for use as DMA anti-tumor vaccines. 
Background art ' 

The invention provides an anti-tumor therapy based on the 
induction or activation of the immune response able to bring 
15 about tumor rejection- The validity of such an idea is 
demonstrated- from the first' clinical results; for example, 
patients treated with a viral vaccine containing the 
Carcinoembryonic Antigen (CEA) encoding sequencss demonstrated 
immune system activation against this antigen (Tsang KY et al. 
20 J. Natl. Cancer. Inst. S7: 982, 1995). 

The activation of an imraune anti-tumor response is 
achievable through four different approaches; 

a) Ex vivo engineering of patient tumor cells in order to 
make them more immunogenic and suitable as a vaccine; 
■ 25 h) Ex vivo engineering of patient immune cells in order to 

pre-activate an in vitro immune response. 

c) Inoculation of naked or liposome capsulated or viral 
particle integrated {retrovirus, vaccinia virus, adenovirus, 
etc.] DMA encoding tumor associated antigens; 
-30 d) Treatment with recombinant or synthetic soluble tumor 

antigens conjugated or mixed with adjuvants. 

The first two approaches consist of the engineering of 
every single patient • cell and are limited in that they are 
necessarily patient-specific, while the latter two are aimed to 
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obtain products coirparable to a traditional drug. 

The new vaccination methods reflect the development of new 
technologies. The recent indications coming from the 
estperimentation on DNA naked vaccines that induce either a 
persistent antibody or a cell immune response, make the 
traditional protein sizbunit vaccines constituted of certain 
specific peptides, inducing a lymphocyte population, obsolete. 
Intramuscularly or intradertnically injected proteins, encoded by 
naked DNA, induce a cytotoxic- specif ic response as well as a 
helper response. This powerful conibination is extremely 
effective but the underling mechanism is not completely 
clarified yet. Muscle cells express class I MKC antigens at low 
levels only, and do not ■ apparently express class II antigens or 
CO- stimulatory molecules. Conseq[uently, tranafected inuscle cells 
are unlikely to play an important role in the onset of the 
inimane response per se. Recent data show that Antigen Presenting 
Cells (APC) , such as macrophages or dendritic cells, play a 
fundamental role in capturing the myocyte released antigen and 
in the subsequent processing and presenting of the respective 
peptides in the context of the class I and II molecules, thus 
inducing a CD8+ cell activation with cytotoxic activity as well 
as activation of the CD4-1- cells co-operating with B lyinphocytes 
in eliciting the antibody response (Coz:r M et al J.- Exp\ Med, 
184:2555, 1936) {Tighe, H. et. al. Irmanology Today 19t89, 1998). 

.. ■ Furthermore, the use of cytokines is known to improve the 
therapeutic effect deriving from ittmunization with DNA. 
Cytokines can be administered in the form of exogenous proteins 
as reported in Jirvine et al., J. Inmmol. 156: 238, 1996, An 
alternative approach is represented Ijy the contemporaneous 
inoculation of both the tumor antigen or the desired cytokine 
encoding plasmids, thus allowing the cytokine to be produced in 
situ {Kim JJ et al. InmiMiol 158: 816, 1997). 

The active iirmunization approach of the present invention 
is based on the use of DNA vectors as vaccines against the MCJC-1 ■ 
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human antigen or Polymorphic l^ithelial Mucin (PEM) , 
overexpressed in tumor cells. MUC-1 is an epithelial luminal 
surface glycoprotein tPatton S. et al, BBA 1241:407, 1995) . In 
the cell transformation process this glycoprotein loses the 
B apicctl localization and its expression level rises dramatically. 
The protein function consists of protecting the luminal 
surfaces, for exanple in the matrenal gland, .ovary, endometrium, 
colon, stomach, pancreas, bladder, kidney, etc. A glycosylation 
defect is reported that makes tunnor cell associated MUC-1 

10 antigenically different from normal cell associated MUC-l. This 
phenomenon causes tumor MUC-1 to esqpose the antigen epitopes 
that are normally masked by the sugar moieties in the noirroal 
cell expressed MUC-1. This characteristic makes tumor MUC-1 
particularly interesting in an induction of a tumor specific 

15 antibody response (Apostolopoulos V. et a.1. Crit. Rev. Immimol, 
14:293, 1994). 

As an objective, the vaccination is aimed at inducing 
immune responses against tumor cells expressing MUCl at high 
levels, preserving at the same time the low expressing normal 

20 epithelia. The DWA vaccination relies upon the entrance of a 
gene or portions thereof inside the body cells followed by 
transcription and translation of the inserted sequence and thus 
the intracellular synthesis of the corresponding polypeptide. An 
itijportant advantage of this system is that the neo-synthesized 

25 protein is naturally processed inside the cell and the produced 
peptides are associated with the Major Histocompatibility 
Complex class I molecules (MHC-I) , The MHC/peptide coirplexes are 
therefore naturally exported to the cell surface where they can 
be. recognized by the inmrne system CDa+ cytotoxic cells. Only 

30 the polypeptides synthesized inside the cell are then processed 
and presented in association with the MHC class I molecules, 
thus making it the only mechanism to stimulate, a specific 
cytotoxic response. Vaccination systems based on protein or 
peptide administration are usually more effective in stimulating 
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the antibody immune response which, to date, has been shown to 
be ineffective in rejecting tumor cells, C?urrent gene therapy 
techniques rely u^on DNR. packaging in recombinant viral vectors 
(retrovirus and adenovirus) . . The naked DNA. administration is 
much more advantageous in terms of effectiveness and safety 
compared to viral vector therapies {Kmar V and Serc&rz E. 
Nature Med. 2: 857, 1996 1 McDcamsl m et al.. New migland J. of 
Med. 334 : 42, 1996) . In fact naked is unable either to 

duplicate or integrate in the host tissue UNA and does not 
induce the ittimune response to viral proteins. 

The use of the ubiquitin to enhance the neo- synthesized 
protein processing and thus cytotoxic lymphocyte induction was 
recently reported {Rodriguez F. et al., J. VlxxDlogy 71: 8497, 
1997) . The use of ubiquitin in order to .generate proteins with 
an N- terminal amino acid, making them unstable and thus prone to 
enhanced degradation, had been previously reported (SeciJniair A. 
et al., SCIENCE 234: 179, 1986). The higher instability of these 
proteins was subsequently related to enhanced intracellular 
processing and presentation of model proteins by MHC-.l (Grant B 
P et al., J, XmmunoJ. 155.- 3750, 1395) {Wu Y and Kipps T.J., J. 
Iimmmol. 159 : 6037, 1997). 

The use of single constructs containing partial antigen 
encoding DNA fragments (influenza virus nucleoprotein) , having a 
higher antigenic presentation efficiency compared to the" 
analogues with the whole antigenic sequence, in TMPl vaccination 
was reported (Anton L. C. et al., J. Jimnunol. 158: 2535, 1997). 
Furthermore the processing of intracellular proteins and 
presentation of the respective peptides by MtIC class I proteins 
in physiologic conditions, underlie the mechanism of 
itranunological surveillance. For a given protein and a specific 
MHC context, there are peptide fragments termed dominants (i. e. 
prevailing on subdominants or cryptics) , which are unable to 
generate any irnnune response . because they are recognized as 
"self". It has now been outlined, according to an aspect of the 
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present invent ioii; that an approach aimed at supporting the non- 
■ dominant epitope presentation by the administration of a mix of 
antigen protein fragments is able to elicit a surprising 
cytotoxic iitmune response. 
5 Deaarxpfcion of the invention 

It has now been found that EMA molecules, ■ encoding 
fragments of a protein overexpressed in tuitioi- cells, can be 
conveniently used to induce an antigen- specific anti-tuntor 
inmune response. 

10 The invention relates particularly to a pharmaceutical 

composition containing one or more DNA encoding Mucin (MUC-l) 
protein fragments. 

The DNA used in the present invention can be plasmid or 
viral DNA, preferably plasmid EMA obtained eiiploying the pMRS30 
15 ejspression vector described in fig. 13. 

The coimpositions according to the invention contain 
preferably at least two DNA fragments of the Mucin (MCIC-1) or of 
another protein overexpressed in tumor ceils. 

The compositions according to the invention contain 
20 preferably at least four fragments, each ranging from 200 to 
about 700 nucleotides, each sequence being jxixtaposed and 
possibly partially overlapping, from about 50 to about 150 
nucleotides, at the 3' and/or 5' end of the adjacent one. 

The DNA fragments according to the invention can be 
25 possibly preceded at the 5 ' end by a ubiquitin encoding DNA 
sequence and possibly also by a Lad portion of Escherichia 
coli. 

The invention relates also to new DNA fragments and to the 
use of Mucin- 1 fragments defined above in the medicine and anti- 
30 Dumor vaccine preparation. 

Oescriptx on of the figures 
Fig. 1 

Nucleotide DNA sequence (with the respective amino acid 
sequence) inserted at the. Xbal site of the pMRS166 eixpression 
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vector. This UNA includes the sequeaice corresponding to 
nucleotides 136-339 of the EMBL sequence J05581, preceded by the 
translation gtart codon, ATG and followed by the two translation 
stop codonsj TGA and TAA. The encoded polypeptide thus includes 
5 a Metionin followed by the amino acids encoded by the 136-339 
fragment of the EMBL sequence JO 55 SI. 
Pig. 2 

Nucleotide DIvIA seqijience {with the respective amino acid 
sequence) inserted at the Xbal site of the pMRSSO escpression 
.10 vector to give the e*1RS169 expression vector. This DNA includes 
the sequence corresponding to nucleotides 205-720 of the BMBL 
sequence J05581, preceded by the translation start codcn, ATG 
and followed by two translation stop codons, TGA and TAA. The 
encoded polypeptide thus includes a Metionin followed by the 
15 amino acids encoded by the 205-720 fragment of the EMBL sequence 
J05581. 

Fig. 3 

Nucleotide DNA sequence (with the respective amino acid . 
sequence) inserted at the Xbal site of the pMRS30 eajpression 

20 vector to give the pMRS168 expression vector. This DMA includes 
the sequence corresponding to nucleotides 631-127S of the EMBL 
■sequence J05581, preceded by the translation start codon, ATG 
and followed by two translation stop codons, TGA and lAA. The 
encoded polypeptide thus includes a Metionin followed by the 

25 amino acids encoded by the 631-1275 fragment of the EMBL 
sequence J05581. 
Fig. 4 

Nucleotide DWA sequence (with the respective amino acid 
sequence) inserted at the Xbal site of the pMRS30 expression 
30 vector to give the pMRS167 expression vector. This DNA includes 
the sequence corresponding to nucleotides 1222-1497 of the EMBL 
sequence J05581, preceded by the translation start codon, ATG 
and followed by two translation stop codons, TGA and TAA. The 
encoded polypeptide thus includes a Metionin followed by the 
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amino acids encoded by the 1222-1497 fragment of the BMBL 
segxience J05581. ' 
Fig. 5 

Nucleotide DMA sequence (with the respective amino acid 
sequence) inserted at the Xbal site of the pMRS30 expression 
vector to give the pMRS175 expression vector. This DMA includes 
the sequence corresponding to nucleotides 136-14 97 of the EMBL 
sequence J05581, preceded by the translation start codon, ATO 
and followed by two translation stop codons, TGA and TAA, The 
encoded polypeptide thus includes a Metiooin followed by the 
amino acids encoded by the 136-1497 fragment of the EMBL 
sequence J0S581. 
■ Fig. 6 

Nucleotide rSvTA sequence (with the respective amino acid 
sequence) termed UBILacI. The encoded polypeptide includes the 
ybiquitin sequence fused to a partial sequence of the bacterial 
protein beta-galactosidase, as described in Chau V". et al. 
Science 243: 1576, 1989. 

Fig, 7 

Nucleotide DNA sequence {with the respective amino acid 
sequence) inserted at the Xbal site of the expression vector 
pMRS30 to give the pMRS171 expression vector. This DNA includes 
the sequence termed UBILacI (see fig. 6) fused to tlie sequence 
corresponding to nucleotides 136-339 of the ©aSL sequence J0S581 
followed by two translation stop codons, TGA and TRA. The coded 
polypeptide thi;^ includes the amino acid sequence reported in 
Pig. 6, fused to the sequence including the amino acids- encoded 
by the fragment 136-339 of the EMBL sequence J05581. 

Pig. 8 

Nucleotide DNA sequence (with the respective amino acid 
sequence) inserted at the Xbal site of the pMRS30 expression 
vector to give the pMRS174 expression vector. This DKA includes 
the sequence termed UBILacI (see fig. 5} fused to the sequence 
partially corresponding to nucleotides 205-720 of the SMBL 
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sequence J0S5S1 followed by two translation stop codons, TGA and 
TAA, The encoded polypeptide thus includes the. amino acid 
sequence reported in Pig. 6, fused to the sequence including the 
amino acids encoded by the fragment 205-720 of the EMBL sequence 
5 J05S81. 

Fig. 9 

Nucleotide DNA sequence [with the respective amino acid 
sequence) inserted at the Xbal site of the pMRS30 expression 
vector to give the pMRS173 expression vector. This DNA includes 
10 the sequence termed UBlLacI (see fig. 6) fused to the sequence 
partially corresponding to nucleotides 631-1275 of the EMBL 
sequence J05581 followed by two translation stop codons, TGA and 
TAA. The encoded polypeptide thus includes the amino acid 
sequence reported in Fig. 6, fused to the sequence including the 
15 amino acids encoded by the fragment 631"ia.75 of the EMBL 
sequence J05581. 

Fig. 10 

Nucleotide DNA sequence (with the respective amino acid 
sequence) inserted at the Xbal site of the pMRS30 escpression 

20 vector to -give the piyiRS172 expression vector. This DMA includes 
the sequence termed UBILacI {see fig. 6> fused to the sequence 
partially corresponding to nucleotides 1222-1497 of the EMBL 
sequence J055B1 followed by. two translation stop codons, TGA and 
TftA, The encoded polypeptide thus includes the amino acid 

25 sequence reported in Fig. 6, fused to the sequence including the 
amino acids encoded by the fragment 1222-1497 of the EMBL 
sequence J05581. 
Fig- 11 

Nucleotide DMA. sequence (with the respective amino acid 
30 sequence) inserted at the Xbal site of the pMRS30 esipression 
vector to give the pMRS176 expression vector. This DNA includes 
the sequence named UBILacI (see fig. 6) fused to the sequence 
partially corresponding to nucleotides 136-1497 of the EMBL 
sequence J05581 followed by two translation stop codons, TGA and 
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TAA. The encoded polypeptide thus inclxides the amino acid 
secjuence reported in. Fig. 6, fused to the sequence including the 
amino acids encoded by the fragment 136-1497 of the EMBL 
. sequence J05581. 
5 Pig. 12 

Electrophoretic analysis on 1% agarose gel in IX TBE. iriHlsIA. 
extracted from CHO, CD34+ dendritic cells and dendritic cells 
from P8MC, respectively, transfected with E*iRS169, and siibjected 
to RT-PCR reaction either with (lanes 4, 8, 12} or without 
10 (lanes 5, 9, 13) Reverse Transcriptase. Molecular weight DWfl. 
marker {lane 1); internal negative controls (lanes 2, 6); 
internal positive controls (lanes 3, 7, 10, 11) ; positive 
control from Promega kit (lane 14) . 
Fig, 13 

15 Nucleotide sequence of the pMRSSO ei^ression vector. The 1- 

2862 region corresponds to the AccI (location 504) - BamHI 
(location 3369) region of the pSV2CAT vector (EMBL M77788) the 
2863-3721 region includes the human cytomegalovirus promoter 
(human cytomegalovirus major immediate -early gene enhancer) ; the 
20 3722-4905 region includes several cloning sites, including Xbal 
(location 3727) , and the processing signal of the rabbit beta- 
globin gene. 

Detailed deacripfcion of the invention. 

A DMA plasmid pool encoding, in eukaryotic cells, fragments 
25 of the MUC-l human protein antigen was prepared. Constructs are 
based on the mammalian ejtpression vector termed pMRS30, 
described in figure 13 and previously claimed in the Patent 
Application W095/11982, and contain partial sequences of the 
MUC-l cDNAs reported in the EMBL database with accession number 
30 J05581, MOG-1 encoding DMA was fragmented so that each fragment 
represents a discrete portion, partially overlapping to the 
adjacent ones. Administration of a mix of such plasraids can 
cause different plasmids to transfsct different APC cells at the 
administration site. Therefore such cells produce and process 
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discrete portions of the MUC-1 protein giving the related 
. peptides. In those conditions, the occurring subdominant and 
cryptic peptides can also be presented in association with class 
I MHC molecules thus generating a cytotoxic immune response. 
5 The present invention thus relates to the use of a group of 

four constructs (Figures 1 to 4) containing MOC-1 cDNA partial 
fragments iti admixture containing at least two of them and a 
■group of four constructs (Figures 7 to 10) containing MUC-1 cDNA 
partial fragment preceded by the DNA encoding a protein sequence 
10 containing Ubiquitin and an Escherichia coli Lac I portion 
(Figure 6) used separately or in admixture containing at least 
two of them. 

The present invention relates also to the use of the 
construct (Figure 5) containing the almost con^lete sequence of 
15 the MOC-1 cDMA and the construct (Figure 11} containing the 
almost complete sequence of the MUC-1 cDNR. preceded by the DNA 
encoding a protein sequence containing Ubiquitin and an 
Escherichia coli Lac I portion. 

The mixture of the four constructs containing the partial 
20 fragments of the MUC-1 CDWA and the mixture of the four 
constructs containing the partial fragments of the MUC-1 cDNA 
preceded by the DMA encoding a protein sequence, containing 
Ubiquitin and an Escherichia coli Lac I portion, represents a 
preferred embodiment of the present invention. 
^5 Constructs according to the present invention can be used 

in the anti -tumor therapy of patient affected with tumors 
characterized by high MUC-1 expression. 

Constructs described in the present invention were obtained 
as follows. 

3 0 In the case of the first series of constructs, the 

fragments of the PttlC-l DNA were obtained by RT-PCR from BT20 
cell line or by DMA partial chemical synthesis. Such fragments 
were then cloned into the pMRS30 expression vector and verified 
by sequencing. 
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In the case of t±ie second series of constructs, the 
fragments were obtained from the first series of constructs by a 
PGR re-awplif icatlon. These fragments were then fused to the DMA 
encoding the Ubiquitin (obtained by RT-PCR from MCF7 cell line 
TtENA) and a partial lacl sequence (obtained ■ by PGR from the 
commercial vector pQEX) . DNA sequences thus obtained were then 
cloned in the pMRS30 ejcpression vector and verified by 
sequencing. For the intended therapeutic or prophylactic uses, 
fragments or construtits according to the invention are suitably 
formulated, using carriers and methods previously employed in 
naked DNA vaccines, as described for example in The 
Inmunologist, 1994, 2=1; WO 90/11092, Proc. Natl. Acad. Sci. 
U.S.A., 1986, 83, 9551; US 5580859; Immunology today 19 (1998), 
89-97); Proc. Natl. Acad. Sci. U.S.A. 90 (1993), 11478-11482; 
Wat. Med. 3 (1997), 526-532; Vaccine 12 (1994), 1495-1498; im 
Cell. Biol. 12 (1993), 777-7B3. The dosages will be determined 
on the basis of clinical and pharmacological -toxicological 
trials. Generally speaking, they will be corrprised between 0.005 
ftg/kg and 5 ng/kg of the fragment mix. The con^sosition of the 
invention can also contain a cytokine or a cytokine encoding 
plasmid . 

The invention will be fiirther illustrated by means of the 
following exanples. 

Example 1. Plasmid pHRS16€ construction. 
■ BT20 tumor cells (ATCC JrrB-19) were cultured in Eagles MEM 
supplemented with 10% fetal calf serum. Ten million cells were 
trypsinized, washed with PBS, and mRNA extracted. 

An aliquot of this JIMA was subjected to RT-PCR (reverse 
transcriptase-polymerase chain reaction) reaction in the 
presence of the following synthetic oligonucleotides: 

Vll (5 CSATCTCTAGAATGACAGGTTCTGGTCATGCftAGC 3) 

The produced Wth fragment, purified and digested with the 
restriction enzyme Xtaal, was cloned into the pMRS30 expression 
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vector, containing the huinan cytomegalovirus promoter and the 
beta-globin polyadenylatlon signal as claimed in the Patent 
W09511982, The resulting pMRSlSS vector contains a fragment 
including the ATG codon, the seqaence corresponding to the 
nucleotides 136-339 of the EMBL sacjusnce J05581, and two stop 
codons, TGA and TAA. 

This fragment is reported in fig. 1. 

Exan^de 2. Plaannid piSRSlSS constriiction. 

An aliquot of the RNA obtained as reported in example 1 was 
aitplified by RT-PCR in the presence of the following synthetic 
oligonuclotides : 

V12 (5 C3lATCTCTAGaJOTK3TCCCCAGCTCTACTGAGftRGAATOC 3) 

V15 (5 GGOKSTQGAGGCCGGQGCTGGCTTGT 3) 

The produced DNA. fragment, purified and digested urith the 
restriction enzymes Smal and Xbal, was fused, by the Sraal 
restriction site, to a UNA fragment entirely synthetically 
constructed, and including a sequence partially corresponding to 
the raicleotides 457-720 of the EMBL sequence J05581 and two stop 
codons, TGA and TAA. The whole fragment was thus cloned in the 
Xbal site of the pMRS30 expression vector. The resulting pMRS169 
vector contains a DMA fragment including the ATG codon, the 
sequence partially corresponding to the nucleotides 205-720 of 
the EMBL sequence J05581, and two stop codons, TGA and TAA. 

This fragment is reported in fig. 2. 

Exanfjle 3. Plasmid paSRSl68 construction. 

An aliquot of the IMA obtained as reported in example 1 was 
amplified by RT-PCR in the presence of the following synthetic 
oligonuclotides! 

V13 (5 GaTCTCTRGAATGGGCTCAGCTTCTAOTCTGGTGCACAACGGC 3) 
V8 (5 GATCTCTAiSaAAGCTrATCACAAGGCAATGAG^^ 3) 
The produced DNA fragment, purified and digested with the 
restriction enzyme Xbal was cloned in the pMRS30 ejqjression 
vector. The resulting pMHS168 vector contains a dna fragment 
including the ATG codon, the sequence corresponding . to the 
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nucleotides 631-1275 of the EMBL sequence J05581, and two stop 
codons, TGA and TAA. 

This fragment is reported in fig. 3. 

Sxan^le 4, Flaaoold pUR5167 construcbiaa. 

An aliquot of the RNA obtained as reported in example 1 was 
subjected to RT-PCR reaction in the preisence of the following 
synthetic oligonucleotides: 

V14 (5 GATCrC^GAATGCTGGTGCTGGTCTGTGTTCTGGTTGCGC 3) 
VI 0 (5 GATCTCTAGAAAGCTTATCACAAGTim::aGWVGTGGCr^ 3) 
The produced !3Wa. fragment, purified and digested with the 
restriction enzyme Xbal was cloned in the piMRS30 expression 
vector. The resulting pMRS167 vector contains a DMA fragment 
.including the ATG codon, the sequence corresponding to the 
nucleotides 1222-1497 of the EMBL sequence J05581, and two stop 
codons., TGA and TAA. 

This fragment is reported in fig. 4. 
Example 5. Plasmid pasRSl75 construction. 

pMRSl65, 169, 168, l€7 plasmids were subjected to pgr 
reaction in the presence of the following nucleotide pairs: 
Vll (see example 1) 

V18 (S AACCTGPJlGCTGGTTCaSTGGC 3) for pMRS166 
VIS (5 GTGCCCAGCTCTACTGaGAAGaATGC 3) 

V20 (5 GCTGGGRATTGAGAATGGAGTGCTCTTGC 3) for pMRSX69 

V21 (5 GGeTCAGCrrCTACTCTGGTGCACAACGGC 3} 

V22 (5 CaaGGCAATGAGATAGAiCAATGGCC 3) for pMRS168 

V23 (5 CTGGTGCTGGTCTGTGTTCTGGTTGCG 3) 

VIO (see exanple 4) for pMRS167 

The foxir DNA fragments obtained in the respective PGR 
reactions were mixed in equitnolar amounts and PGR reacted in the 
presence of the Vll and VlO oligonuclotides . 

The produced DtJA fragment, purified and digested with the 
Xbal restriction enzyme, was cloned in the piMRS30 expression 
vector. The resulting pMRSl75 vector contains a DMA fragment 
including the ATG codon, the sequence partially corresponding to 
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the nucleotides 136-1497 of the BMBL sequence J05581 and two 
stop codons TGA and Tftfl.; 

This fragment is reported in fig. 5. 

Exairple 6. Plasmid pMRS171 cons true tiofti.. 
5 . MCP7 tumor cells (ATCC HTB-22} cultured in Eagles MBM 

supplemented with 10% fetal calf eerum. Ten million cells were 
trypsinized, washed with PBS, and mRNA extracted. 

An aliquot of this RNA was subjected to RT-PCR in the 
presence of the following synthetic oligonucleotides; 
10 XJBIup (5GATCTCTAGftATGC3W5a^TCTTCGTC3AAGACECnxmC^ 3) 

UBIdown 

(STCACCAGCGAG&CmxaACAGCCaTGCACGACTACCGTGCCTa^ 
ACCAGG 3) 

The reaction produces a DNA fragment termed fragment 1. 
15 DMA from pGEXllT (Pharmacia) was subjected to PCR reaction 

in the presence of the following synthetic oligonucleotides} 
Laclup (5CCTCa5TCrCAGAGGTGGGAGGCA036TAGTGGTGCATGGC^^ 
GTCTCGCTOGTGAAAAG 3) 

Lacldown (SGATOSGATTCTCGGGAAACCTCTOGTGCCaGCrPGC 3) 
20 This reaction gives a DNA fragment termed fragment 2. 

The 1 and 2 DNA fragments, obtained in the respective PCR 
reactions , were mixed in ecpiimolar amounts and subj ected to PCR 
reaction in presence of the UBIup and Lacldown oligonucleotides. 

The produced DNA ■ fragment", purified and digested with the 
25 restriction enzymes Xbal and BamHI, was cloned into the ptJClS 
commercial plasmid. The resulting pMRS156 vector contains a UNA 
fragment including the sequence encoding the ubiquitin fused to 
the sequence encoding a bacterial beta-galactosidase portion. 
This fragment, termed UBILacI, is reported in fig. 6. 
30 Plasmid pMRS166 DNA was subjected to a PCR reaction in 

presence of the following synthetic oligonucleotides: 

V3 CSGATCGGATGCACS.GGTTCTGGTCATGCRAGC 3) 

V4 (see Exansple 1) 

The produced DNA fragment, purified. and digested with the 
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restriction enzymes Xbal and BamHI, was fused, by ligation into 
the two BamHI sites, to the UBILacI fragment deriving from the 
pMRSl56 plasmid. The resulting fragment was cloned into the 
pMRS30 expression vector. The resulting pMRS171 vector contains ; 
5 a DNA fragment including the UBILacI sequence, the sequence 
corresponding to the 136-339 nucleotides of the EMBL sequence 
J05581 and two stop codons, TGA and TAA. This fragment is 
reported in fig. 7. 

EKample 7. Plaaanici pMRS174 construction, 
10 Plasmid pMRS169 DNA was subjected to PGR reaction in the 

presence of the following synthetic oligonucleotides: ^ 

V5 (5GATCQGATCCGTGCa=W3CTCTACTGJl(SiJmA 3) 

VS (SOATCnxnsUSaAAGCTTATCaCtnxSGGAAT^^ 3) 1 

I 

The produced DNA fragment, purified and digested with the \ 
15 restriction enzymes Xbal and BairiHI,.was fused, by ligation into 
the two BatnHI sites, to the UBILacI fragment deriving from the 
pMRSlSS plastnid. The resulting fragment was cloned into the 
pMRSSO expression vector. The resulting pMRS174 vector contains 
a DNA fraginent including the UBILacI sequence, the sequence 
20 corresponding to the 205-720 nucleotides of the BMBL sequence ■ 
J05581, and two stop codons, TGA and TAA. This fragment is 
reported in fig, 8 i 

Exainple 8, Plasinid pMRSlVS construe tiooa. 

Plasmid pmsisa DNA was subjected to PGR reaction in the 
25 presence of the following synthetic oligonucleotides; 

V7 {SGATCGGATCGOK^TCAaCTrcTACTCTGGTGCACAAOGGC 3) 
V8 {see example 3) ' 

The produced DNA fragment, purified and digested with the 
restriction enzymes Xbal and BamHI, was fused, by ligation into 
.30 the two BamHI sites, to the UBILacI fragment deriving from the 
pMRSiSS plasmid. The resulting fragment was cloned into the 
pMRS30 expression vector. The resulting pMRS173 vector contains 
a DNA fragment including the UBILacI sequence, the sequence 
corresponding to the 631-1275 nucleotides of the SMBL sequence 
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J05581^ and two stop codons, TGA and TiVA. This fragment is 
■ reported in fig. 9, 

Sxmpl6 9, Plasmid pMRS172 construction « 

Plasmid pMRS167 DNR was . subj ected to PGR reaction in the 
5 presence of the following synthetic oligonucleotides: 
V9 (5 GJVTCGGATCC(:?IX3GTGCT(3GTCTGTGTTCrGGI^ 3) 
Via (see exaTiple 4) 

The produced DNA fragment, purified and digested with the 
restriction enzymes Xbal and BanfiJI, was fused, by ligation into 
10 the two Baitifn: sites, to the UBILacl fragment deriving from 
. PI4RS156 plasmid. The resulting fragment was cloned into the 
pMRS30 expression vector. The resulting pMRS172 vector contains 
a DNA fragment including the UBILacl sequence, the sequence 
corresponding to the 1222-1497 nucleotides of the EMBL sequence 
15 J05581, and two stop codons, TGA and TAA. This fragnient is 
reported in fig. 10. 

Exainple 10. Plasmid pHR5176 construction. 

Plasmid pMRS167 DNA was subjected PGR reaction in the 
presence of the following synthetic oligonucleotides: 
20 V3 (see exaitple 6} 

VIO (see exairple 4) 

The produced DNA fragment, purified and digested with the 
restriction enzymes Xbal and BamHI, was fused, by ligation into 
the two BarriHI sites, to the UBILacl fragment deriving from 
■ 25 pMRS156 plasmid. The resulting fragment was cloned into the 
pMRS30 expression vieetor. The resulting piMRS176 vector contains 
a DNA fragment including the UBILacl sequence, the sequence 
corresponding to the 136-1497 nucleotides of the EMBL sequence 
J05581, and two stop codons, TGA and TAA. This fragment is 
30 reported in fig. 11, 

Exanple 11. Eukaryotic cell tranefection and testing for 
transcription . 

CHO (Chinese Hamster Ovary) cells were cultured in alpha 
MEM supplemented with ribonucleotides and deoxyribonucleotides 
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at transf ection time. 

Dendritic cells were obtained from CD34+ hemopoietic 
precursors cultured In IMDM vfithout serum, supplemented with GM- 
CSP, IL4, SCF, Flt3 and TUFalpha. After 7 days the obtained cell 
S population was transf ected. ^; 
Dendritic cells were obtained from monocytes isolated from 
PBMC {peripheral blood mononuclear cells} , cultiired in RPMI 
supplemented with PCS, GM-CSF, and IL-4. After 7 days the 
obtained cell population was transf ected, 
10 In each case, about one million cells were transfected with 

one of the plasmids reported in exaniples 1 to 10, Transf ection 
was carried out using 3 jig of plasmid DNA and 4 fil of DMRIE 
(Gibco) by lipofection. ;• 
After 24 hours cells were harvested, washed with PBS and 
15 lysed in order to extract the tnFSNA. 

A coRNA aliquot was subjected to RT-PCR reaction in the 
presence of the oligonucleotide pair specific for the 
transfected DNA plasmid. ! 
This experiment was carried out for each plaaftid reported 
20 in the examples 1 to 10, using the following oligonucleotide 
pairs: V11/V4 for pMRS166, V12/V6 for pMRS169, V13/V8 for 
pMRSl€8, V4/V10 for pMEiS167, V4/V10 for pMRS175, UBIup/V4 for 
PMRS171, UBIup/VS for pMRS174, UBIup/V8 for piMRS173, UBIup/VlO 

for pMRS172, VI4/V10 for pMRS176. n; 

25 As a representative example, figure 12 reports the 

electrophoretic analysis of the DNA fragments obtained by RT-PCR 
from the mRNA of the three cell populations, transfected with 
the piyiRSl69 plasmid. In this case the oligonucleotide pair 
V12/V6 was used. 

30 Exaiuple 12. In vivo study results. 

In the in vivo studies, the mixtures of the four fragments 
and the pMRS30 plasmid (vector without insert and thus used as a 
negative control) were used. In order to test the occurred 

'immunization, an ELISA test was used to show, the human mucin fe 
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specific antigeais. 

The in vivo studies were conducted using human MUCl 
transgenic C57BL mice. As a consequence in these animals the 
WDCl protein represents a self -protein. The employed vaccination 
schedule consists of 3 intradermic (dorsal portion, 50 
micrograms DNA for each side) administrations (at days 0, 14, 
28). of 100 micrograms plasmid DNA., At day 14 after the last 
administration, the animals were sacrificed and sera were tested 
for anti -human mucin antibodies. 

The assayed fragment mixes, cxbject of the present 
invention, stimulated a good immune response in the treated 
animals . 

On the other hand, vaccination experiments with a 60- 
aminoacid peptide corresponding to the 20 aminoaclds reported in 
fig. 2, from location 86 to location 105, repeated three times 
(this peptide is termed 3XTR) , were also carried out. 

The two vaccinations differ in the type of the elicited 
antibody response. The antibody titer results much more higher 
in the vaccination with 3XTR, Furthermore the noticed IgG 
subtypes are in favor of an essentially humoral (antibody) 
response in the case of vaccination with 3XTR, and of a cellular 
response (cytotoxic) in the case of vaccination with DNA. For 
anti-tumor therapy, a principally cytotoxic immune response is 
preferable. Because the experiments were carried out on 
transgenic mice, in whom the human mucin is "self",, we can 
foresee a similar response in humans. This response could 
justify the use, as DNA vaccines, of the compounds of the 
present: invention in the treatment of MtJCl .overicpressing human 
tumors . 
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CIAJMS 

1. Pharmaceufcical composition containing one or more DNA 
molecules, encoding fragments of a protein overe»pressed in 

S tumor cells in order to induce an antitumor Ag-specific immune 
respcnse, in cotrijination with suitable excipients and adjuvants. 

2. Pharmaceutical cotrposition according to claim 1 wherein the 
overexpressed protein is MQC-l. 

3 . Pharmaceutical coropositiGn according to claim 1 or 2 
10 containing at least two DNA molecules each containing a cDNA. 

secfuence encoding a Mucin fragment (MUC-1) . 

4. Conijosition according to claim 3 containing at least three 
TMH molecules each containing a cDNA sequence encoding a Mucin 
fragment (MUC-l) . 

15 5. Cotrposition according to claim 4 containing at least four DWA 
molecules each containing a cDNA sequence encoding a Mucin 
fragment (MUC-l) , 

6. Composition according to claims 3, 4 or 5 wherein the DNA 
se<xuences comprise about 200 to about 700 nucleotides, each 

20 sequence being contiguous and possibly partially overlapping, 
from about 50 to about 150 nucleotides at the 3' and/or 5' end, 
to the adjacent one, 

7. Pharmaceutical composition according to any claim from 2 to 6 
wherein the used mixture consists of, at least, two plasmid DNA 

-25 molecules, each containing a DNA fragment selected from those 
whose sequences are described in figures 1, 2, 3, and 4. 

8. Pharmaceutical composition according to claim 7 wherein the 
used mixture consists of the pool of plasmid DiSA molecules, 
where each molecule contains a DNA fragment selected from those 

:30 vjhose sequences are described in figures 1, 2, 3, and 4. 

9. Pharmaceutical composition according to claim 1 or 2 wherein 
a plasmid DMA molecule containing the sequence described in 
figure 5 is used. 

.10. Pharmaceutical conqposition according to claims 7, 8, or 9 
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wherein the used plastnid DNft. itoiecules derive from the fusion of 
■ the pMRS30 esspression vector in Fig. 13 to each sequence 
described in figures 1, 2, 3, 4, 5. 

11. Pharmaceutical conposition according to claims 2 to 

5 wherein the used sequences, corresponding to single fragments of 
the protein, are preceded in the 5 ' termini by the sequence 
described in Fig. 6 encoding the libiquitin and a Lad portion 
from Escherichia Coli. 

12. Pharmaceutical composition according to claim 11 wherein the 
10 mixtxire consists of one or more sequences deriving from joining 

the pfKS3 0 expression vector, described in Fig. 13, to a EaiA 
sequence selected from those described in figin^es 7, 8, 9, and 
10. 

13. Pharmaceutical conposition according to claim 11 wherein the 
15 mixtui-e consists of the totality of the sequences deriving from 

joining the pMRS30 expression vector to a UNA seqpaence selected 
from those described in figures 7, 8, 9, and 10, 

14. Pharmaceutical composition according to claim 11 wherein the 
mixture consists of a sequence deriving from joining the pMRS30 

20 expression vector to the sequence described in figure 11. 

15- Pharmaceutical composition according to any preceding 
claims, further containing a cytokine or a cytokine encoding 
plasmid . 

16. A plasmid DMA molecule consisting of the pMRS30 expression 
25 vector joined to a DMA sequence, encoding a MOC-1 protein 

fragment and whose sequence is selected from the group of those 
described in figures 1, 2, 3, 4, and 5. 

17. A DNA molecule encoding a protein MUC-l fragment preceded in 
its 5' terminus by the sequence described in Fig. 6. 

30 18. A DMA molecule according to claim 17 selected from those 
described in figures 7, 8, 9, 10, and 11. 

19. A plasmid. DNA molecule obtained by joining the pMRS 
eiipression vector to a DNA molecule selected from those of claim 
■ . 17 or 18 , 
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20, Use of DNA molecules of claiins 16-19 in the preparation of a 
ooinposition with anti-tumor effect. 
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Figure 1 



1 ATGACAGGTTCTGGTCATGCAAGCTCTACCCCAGGTGGAGAAAAG 
l^MetThrGl ySerGlyHi sAi aSer Ser Thr ProGI yGi yG! uLys 
4 6 GAGACTTCGGCT ACGC AGAGAAGTTCAGTGCCCAGCTCTACTGAG 
le^-GI uThrSerAj aThrGl nArgSerSer Val ProSerSerThrGI u 
9 1 AAGAATGCTGTGAGTATGACCAGC AGCGTACTCTCCAGCCACAGC 
31>'LysAsnAI aVat SerKfetThr SerSar Val LeuSerSer HJsSer 

13 6 CCCGGTTCAGGCTCCTCCACCACTCAGGGACAGGATGTCACTCTG 
46 ► ProGI ySer Gl ySer SerThr Thr Gl nGI yGI nAspVal Thr Leu 

181 GCCCCGGCCACX3GAACCAGCTTCAGGTTGATAA 
61^AI aProAl aThrGl uProAlaSerGly 
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Figure 2 



1 ATGGTGCCCSGCTCTACTGAGi^GAATGCTGTGAGTATGACCAGC 
lUVtetVal ProSer SerThrGI uLysAsnAl aVal SerMetThr Ser" 

4 6 AGCGTACTCTCCAGCCACAGCCCCGGTTCAGGCTCCTCCACCACT 
le^- Ser Va I LeuSer Ser Hi sSer P roGl ySer G) ySer Ser Thr Thr 
9 1 CAGGGACAGGATGTCACTCTGGCCCCGGCCACGGAACCAGCTTCA 
31^G! nGI yGl nAspVa I Thr LeuAl aProAl aThrG! uProAl aSer 
13 6 GGTTCAGCTGCCACCTGGGGACAGGATGTCACCTCGGTCCCAGTC 
46 ► GI ySer Ai aAl aThr T rpGi yG I nAspVa l Th r Se r Va I P roVa i 
181 ACCAGGCCAGCCCTGGGCTCCACCACCCCGCCAGCCCACGATGTC 
61^ Thr ArgProAl aLeuGI ySer Thr Thr Prop roA! aHi sAspVa i 
226 ACCTCAGCCCCGGACAACAAGCCAGCCCCGGGAAGTACTGCTCCA 
ve^-ThrSerAl aProAspAsnLysProAf aProGl ySer Thr AI aPro 
271 CCAGCACACGGTGTTACCTCGGCTCCGGATACCAGGCCGGCCCCA 
91 ProAl aHi sGI yVal Thr Ser A[ aProAspThrA rgProAt aPro 
316 GGTAGTAGCGCCCCTCCTGCCCATGGTGTCACATCTGCCCCGGAC 
106> GI ySerThr AI aProProAl aHi sGI yVal ThrSer AI aProAsp 
361 AACAGGCCTGCATTGGGTAGTACAGC ACCGCCAGTACACAACGTT 
121>'AsnArgProAl aLeuGl ySerThr AI aProProVal Hi sAsnVal 
406 ACTAGTGCCTCAGGCTCTGCTAGCGGCTCAGCTTCTACTCTGGTG 
136^ Thr Ser At aSer GI ySer AI aSer GI ySer AI aSer Thr LeuVa I 
451 CACAACGGCACCTCTGCQCGCGCGACCACAACCCCAGCGAGCAAG 
151^ Hi sAsnGI yThr Ser AI aArgAI aTh r Thr Thr P roAl aSer Lys 
496 AGCACTCCATTCTCAATTCCCAGCTGATAA 
166^SerThrProPheSer I i eProSer* * • • • • 
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Figure 3 



1 ATGGGCTCAGCTTCTACTCTGGTGCACAACGGCACCTCTGCCAGG 
l^MetGI ySerAI aSerThr LeuVal Hi sAsnGl yThr Ser Al aArg 
-46 GCTACCACAACCCCAGCCAGCAAGAGCACTCCATTCTCAATTCCC 
16^Ai aThrThrThr ProAl aSer LysSerThr ProPheSer 1 I ePro 
91 AGCCACCACTCTGATACTCCTACCACCCTTGCCAGCCATAGCACC 
31>- Ser Hf sH} sSerAspThr ProThr Thr LeuAI aSer HI sS&r Thr 
136 AAGACTGATGCCAGTAGCACTCACCATAGCACGGTACCTCCTCTC 
46^ LysThrAspAl aSer Ser Thr Hi sHf sSerThrVal ProProLeu 

181 acctck;tccaatcacagcacttctccccagttgtctactggggtc 
61> Thr Ser SerAsnHi sSer Thr Ser P roGJ nLeuSer Thr GJ yVa ! 

226 TCTTTCTTTTTCCTGTCTTTTC AC ATTTCAAACCTCCAGTTTAAT 
16>- Ser PhePhePheLeuSer PheHi s I 1 eSerAsnLeuGJ nPheAsn 
271 tcctctctggaagatgccagcaccgactactaccaagagctgcag 
91^ SerSer LeuGl uAspProSerThrAspTyrTyrGI nGI uLeuG! n 
33. 6 agagacatttctgaaatgtttttgcagatttataaacaagggggt 
lOe^A rgAsp I 1 eSer Gi uMst PheLeuG[ n I I eTy rLysGl nGI yGI y 
361 tttctgggcctctccaatattaagttcaggccaggatctgtggtg 
121^ PheLeuGlyLeuSerAsni f eLysPheArgProGlySer Val Va! 
406 GTACAATTGACTCTGGCCTTCCGAGAAGGTACX: ATCAATGTCCAC 
136^ ValGI nLeuThr LeuAI aPheArgGI uGi yThr tleAsnVal His 
4S1 gacgtggagacacagttcaatcagtataaaacggaagcagcctct 
ISl^AspVal GI uThr GI nPheAsnGI nTy rLysThr Gl uAI aAl aSer 
496 ggatataacctgacgatctcagacgtcagcgtgagtgatgtgcc A 

ISS^A rgTyrAsnLeuThr I ! eSerAspVa t Ser Va! SerAspVa I Pro 

541 tttcctttctctgcccagtctggggctggggtgcc aggctggggc 
181^ PheP roPheSer Al aGI nSer Gi yAt aG! yVal ProG! ylrpGI y 
5BpS atcgcgctgctggtgctggtctgtgttctggttgcgctggccatt 
196^ i leAlaLeuLeuVal LeuVal CysValLeuValAlaLeuAl a] le 
631 GTCTATCTCATT^GCCTTGTGATAA 
211>-ValTyrLeui leAlaLeu 
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Figure 4 



1 ATGCTGGTGCTGGTCTGTGTTCTGGTTGCGCTGGCCATTGTCTAT 
l^MelLeuVal LeuVai CysVal LeuValAlaLeuAl al i eVal Tyr 
4 6 CTC ATTGCCTTGGCTGTCTGTCAGTGCCGCCGAAAGAACTACGGG 
16 >■ Leu I I eAl aLeuAl aVai CysGi nCysArgArgLysAsnTy rG[ y 
91 CAGCTGGACATCTTTCCAGCCCGGGATACCTACCATCCTATGAGC 
31>G\ nLeuAspi I ePheProAl aArgAspThrTyrHi sProMetSer 

13 6 GAGTACCCCACCTACCACACCCATGGGCGCTATGTGCCCCCTAGC 
46^GI uTyrProThrTyrHI sThrHi sGlyA rgTyrVal ProProSer 

181 AGTAOCGATCGTiiJSCCCCTA'raAGAAGGTTTCTGCT^GTi^^ 
Sl^SerThrAspArgSer ProTyrGI uLysValSerAlaGi yAsnGty 

226 GGCAGCAGGCrCTCTTACACAAACCCACK^AGTGGCAGCEJ^ 
76^ Gi ySer Ser LeuSerTy rThrAsnProAl aValAI aAl aThr Ser 

271 GCCAACTTGTGATAA 
91^AI aAsnLeii 
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Figure 5 



1 ATGACAC3GTTCTGGTCATGCAAGCTCTACCCCAGGTGGAGAAAAG 
3>MetThrGI ySer Gl yHi sAI aSer Ser Thr ProGI yGJ yGI uLys 
46 GAGACTTCGGCTACCCAGAGAAGTTCAGTQCCCAGCTCTACTGAG 
16^GI uThrSef AlaThrGlnArgSerSerValProSerSerThrGI u 
91 AMSAATGCTGTGfiJGTATGACCAGCAGCGTACTCTCCAGCCACAGC 
■n> LysAsnAI aVa I SerMetThr Ser Ser Val LeuSerSer HI sSer 
136 CCCGGTTCAGGCTCCTCCACCACTCAGGGACAGGATGTCACTCTG 
■ "46K ProGI ySer Gt ySer SerThrThr GI nGi yGI nAspVa I Thr Leu 
1 81 GCCCCGGCCACGGAACCAGCTTCAGGTTCAGCTGCCACCTGGGGA 
ei*- Al aProAJ aThr Gl uProAl aSer Gi ySer Al aAl aThr TrpGI y 
226 CAGGATGTCACCTCGGTCCCAGTCACCAGGCCAGCCCTGGGCTCC 
76^GI nAspValThrSerVaJProVatThrAfgProAlaLeuGI ySer 
271 ACCACCCCGCCAGCCCACGATGTCACCTCAGCCCCGGACAACAAG 
giKThrThrProProAlaHisAspValThrSerAlaProAspAsnLys 
316 CCAGCCCCGGGAAGTACCGCTCCACCAGCACACGGTGTTACCTCG 
106> P ro Al aP roGI ySer Thr Al aP roProAl aHi sGI yVa ! Th r Ser 
361 GCTCCQGATACCAGGCCGGCCCCAGGTAGTACCGCCCCTCCTGCC 
121>AI aProAspThrA rgProAl aProGI ySerThr Al aProProAl a 
406 CATGGTGTCACATCTGCCCCGGACAACAGGCCTGCATTGGGTAGT 
136^ Hi sG! yVa f Thr Ser Al aProAspAsnArgProAl aLeuGI ySer 
451 ACAGCACCGCCAGTACACAACGTTACTAGTGCCTCAGGCTCTGCT 
151> Thr Al aP ro P roVa I Hi sAs n Va I Thr Ser Al aSer Gl ySer AJ a 
496 AGCGGCTCAGCTTCTACTCTGG.TGCACAACGGCACCTCTGCGCGC 
166> Ser GI ySer Ai aSer Thr LeuVa I Hi sAsnG! yThr Ser Al aArg. 
541 GCGACCACAAOCCCAGCGAGCAAGAGCACTCCATTCTCAATTCCC 
18l>-AlaThrThrThrProAlaSerLysSerThrProPheSer 1 J.ePro 
586 AGCCACCACTCTGATACrcCTACCACCCTTGCCAGCCATAGCACC 
196^ Ser Hi sHf s SerAspThr ProThr Thr LeuAl aSerHi s Ser Thr 
63 1 A^iGACTGATGCCAGTAGCACTCACCATAGCACGGTACCTCCTCTC 
21iy LysThrAspAlaSsr Ser Thr Hi sHi sSerThr Val ProProLeu 
67 6 ACCTCCTCCAATCACAGCACTTCTCCCCAGTTGTCTACTGGGGTC 
226^Thr SerS6rAsnHi sSerThr Ser ProGi nLeuSerThr Gl yVal 
721 TCTrTCTTTTTCCTGTCTTTTCACAlTTCAAACCTCCAGTTTAAT 
241^SerPhePhePheLeuSerPheHis I I eSerAsnLeuGi nPheAsn 
766 TCCTCTCTQGAAGATCCCAGCACCGAGTACTACCAAGAGCTGCAG 
2S6^ Ser Ser LeuGI uAspProSer ThrAspTy rTy rG! nGI uLeuGJ n 
811 AGAGACATTTCTGAAATGTTTTTGCAGATTTATAAACAAGGGGGT 
27lKArgAspl ! eSerGi uNtet PheLeuGi n 1 1 eTyrLysGl nG! yG! y 
856 TTTCTGGGCCTCTCCAATATTAAGTTCAGGCCAGGATCTGTGGTG 
286^ PheLeuGi yLeuSerAsn I i eLy sPheArgProG! ySer Va I Va i 

(Continued); 
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Figure 5 (continued) 



901 GTACAATTGACTCTGGCCTTCCQAGAAGGTACCATCAATGTCCAC 
301^ Va! Gi nLeuThr LeuAl aPheArgGi uGl yThr 1 1 eAsnVal Hi s 
946 GACGTGGAGACACAGTTCAATCAGTATAAAACQGAAGCAGCCTCT 
•316^AspValGI uThrGI nPheAsnGI nTyrLysThr GI uAIaAJ aSer 
991 CGATATAACCTGACGATCTCAGACGTCAGCGTGAGTGATGTGCCA 
33lVArgTyrAsnLeuThr i I eSerAspVal Ser Val SerAspVai Pro 

103 6 TTTCCTTTCTCTGCCCAGTCTGGGGCTGGGGTGCCAQQCTGGGGC 
346> PheProPheSsr Al aGi nSer Gl yAl aGI yVai ProGt yTrpGI y 

1081 ATCGCGCTGCTGGTGCTGGTCTGTGTTCTGGTTGCGCTGGCCATT 
361^ 1 i eAl aLeuLeuVal LeuVal CysVal LeuVal Al aLeuAl al I e 

112 6 GTCTATCTCATTGCCTTGGCTGTCTGTCAGTGCCQGCGAAJ^AAC 
376> Val Ty rLeu I I eAraLeuAl aVa ! CysGi nCys Arg ArgLysAsn 
. 1171 TACGGGCAGCTGGACATCTTTCCAGCCCGGGATACCTACCATCCT 
391>TyrGI >i3lnLeuAsp! lePheProAl aArgAspThrTyrHi sPro 

121'6 ATGAGCGAGTACCCCACCTACCACACCGATGGGCGCTATGTGCCC 
406>MetSer Gl uTyrProThrTyrHi sThr.Hi sGl yA rgTyrVal Pro 

12 6 1 CCTAGCAGTACCGATCGTAGCCCCTATGAGAAGGT-TTC1GCAGGT 
421^ ProSer Ser ThrAspArgSerProTyrGI uLysVal Ser Al aG! y 

1306 AATGGTGGCAGCAGCCTCTCTTACACAAAecay^CAGTGGCAGCC 
436^ AsnGl yGi ySsr Ser LeuSer Ty rThrAsnProAl aValAI aAl a 

1351 ACTTCTGCCAACTTGTGATAA 
451>' Thr Ser Al aAsnLeu* i • • • • 
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Figure 6 



1 ATGCAGATCTTCGTGAAGACCCTGACTGGTAAGACCATCACTCTC 
l^MetGlnl lePheValLysThrLeuThrGlyLysThr I ieThrLeu 
• 46 GAAGTGGAGCCGAGTGACACCATTGAGAATGTCAAGGCAAAGATC 

16> e\ uValGI uProSerAspThr 1 1 eGS uAsnValLysAl aLys] I e 

9 i CAAGACAAQGAAGGCATCCCTCCTGACCAGCAGAGGCTCATCTTT 

31KGI nAspLysGI uGJyl I eProProAspGI nGl nArgUeul leftie 
136 GCAGGCAAQCAGCTGGAftGATGGCCGCACTCTTTCTGACTACAAC 

45^AI aGlyLysG! nLeuGi uAspGI yA rgThr LeuSerAspTyrAsn 
181 ATCCAGAAAGAGTCCACCCTGCACCTGGTGCTCCGTCTCAGAQGT 

61^ 1 1 eG[ nLysGI uSer Thr LeuHi sLeuVa! LeuArgLeuArgGI y 
2 26 GGGAGGCACGGTAGTGGTGCATGGCTGTTGCCCGTCTCGCTGGTG 

76> Gl yA rgHi sGlySerGI yAl aT rpLeuLeuProVal Ssr LeuVa! 
271 AAAAGAAAAACCACCCTGGCGCCCAATACGCAAACCGCCTCTCCC 

91^ Lys ArgLysThr Thr LeuAl aP roAsnThr GI nThr Al aSer Pro 
316 CGCGCGTTGGCCGATTCATTAATGCAGCTQGCAOGACAGGTTTCC 
106^A rgAI aLeuAl aAspSer LeuMet Gl nLeuAl aArgGI nVal Ser 
361 CGAGGATCC 
121>-ArgGlySer 
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Figure 7 



1 ATGCAGATCTTCGTGAAGACCCTGACTGGTAAGACCATCACTCTC 
!► Met Gin I I ePheVai LysThr LeuThr Gl yLysThr ! ieThrLeu 
4 6 GAAGTGGAGCCGAGTGACACCATTGAGAATGTCAAGGC AAAGATC 
16>-GI uVal Gl uProSerAspThr 1 1 eGi uAsnVal LysAI aLysi ! e 
91 CAAGACAAGGAAGGCATCCCTCCTGACCAQCAGAGGCTCATCTTT 
21>G\ nAspLysGI uGl yl I eProProAspGI nGI nArgLeu I I ePhe 

46^ A I aGl yLysGt nLeuGI uAspGI yA rgThr LeuSerAspTyr Asn 

181 ATCCAGAAAGAGTCCACCCTGCACCTGGTGCTCCGTCTCAGAGGT 
61^ I I eGl nLysGI uSerThir LeuHi sLeuVal LeuArgLeuArgGI y 
226 GGGAGGCACGGTAGTGGTGCATGGCTGTTGCCCGTCTCGCTGGTG 
76> Gl yA rgHi sGl ySer Gi yAi aTrpLeuLeuProVa I Ser LeuVal 
271 AAAAGAAAAACCACCCTGGCGCCCAATACGCAAACCGCCTCTCCC 
91^ Lys ArgLysThr Thr LeuA! aProAsnThr GI nThr Al aSer Pro 
• 3 1 6 CGCGCGTTGGCCGATTCATTAATGC AGCTGGCACGACAGGTTTCC 
lOe^A rgA] aLeuA! aAspSer LeuMetGI nLeuAl aArgGl nVal Ser 
361 CGAGGATCCACAGGTTCTGGTCATGCAAGCTCTACCCCAGGTQQA 
121> A rgGI ySer TTir Gl ySer Gl yHi s Al aSer Ser Thr P roGI yG I y 
406 GAAAAGGAGACTTCGGCTACCCAGAGAAGTTCAGTGCCCAGCTCT 
136>GI uLysGI uThr Ser Al aThr Gl nArgSer Ser Val ProSerSer 
45 1 ACTGAGAAGAATGCTGTGAGTATGACCAGCAGCGTACTCTCCAGC 
ISl^fhrGJuLysAsnAI aVa I SerMetThr Ser Ser Val LeuSer Ser 
496 CACAGCCCCGGTTCAGGCTCCTCCACCACTCAGGGACAGGATGTC 
166> Hi sSer P roGI ySe r Gl ySer Ssr ThrThr Gl nGI yGI nAspVal 
541 ACTCTGGCCCCGGCCACGGAACCAGCTTCAGGTTGATAA 
ISl^Thr LsuAI aProAl aThr Gl uProAl aSer Gl y • 
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Figure 8 

■ 1 ATGCAGATCTTCGTGAAGACCCTGACTGGTAAGACCATCACTCTC 
l^MetGi nl 1 ePheVal LysThr LeuThrGi yLysThr I I eThr Leu 

45 GAAGTGGAGCCGAGTQACACCATTGAGAATGTCAAGGCAAAGATC 
le^GI uVai Gi uProSerAspThr I I eGI uAsnVal LysAJ aLys J ! e 
91 CAAGACAAGGAAGGCATCCCTCCTGACCAGCAGAGGCTCATCTTT 
Bl^GJ nAspLysGl uGlyl I eProProAspGI nGl nArgLeul lePhe 

46^AiaGlyLysGI nieuGI uAspGi yA rgThr UeuSerAspTyrAsn 
181 ATCCAGAAAGAGTCCACCCTGCACCTGGTGCTCCGTCTCAGAGGT 

61^ I I eGI nLysGI uSerThr LeuHj sLeuVal LeuArgLeuArgGI y 
226 QGGAGGCACGGTAGTGGTGCATGGCTGTTGCCCGTCTCGCTGGTG 

76> GI yA rgHi sGf ySer Ql yAl aT rpLeuLeuProVal Ser LeuVal 
271 AAAAGAAAAACCACCCTGGCGCCCAATACGC AAACCGCCTCTGGC 

■ 91>'LysArgLysThrThr LeuAl aProAsnThr GI nThr AI aSer Pro 
316 CGCGCGTTGGCOGATTCATTAATGCAGCTGGCACGACi^TTTCC 
106 ►a rgAI aLeuAt aAspSer LeuMetGI nLeuAl aArgGi nVal Ser 
361 CGAGGATCCGTGCCCAGCTCTACTGAGAAGAATGCTGTGAGTATG 
121>A rgGI ySer Va! ProSerSerThr GI uLysAsnAi aValSsrMst 
406 ACCAGCAGCGTACTCTCCAGCCACAGCCCCGGTTCAGGCTCCTCC 
136>ThrSerSer Val LeuSer Ser Hi sSerProGI ySerGI ySerSer 
45 1 ACCACTCAGGGACAGGATGTCACTCTGGCCCCGGCCACGGAACCA 
151>-ThrThr GI nGl yGI nAspVal Thr LeuAl aProAl aThr GI uPro 
496 GCTTCAGGTTCAGCTGCCACCTGGGGAC AGGATGTCACCTCGGTC 
166^AI aSerGI ySer Af aAf aThrTrpGI yGI nAspVal Tiir Ser Val 
54 1 CCAG'rCACCAGGCCAGCCCTGGGCTCCACCACCCCGCCAGCCCAC 
181^ P roVa I Thr A rgProAl aLeuGI ySer Thr Thr ProProAl aHi s 
.586 GATGTCACCTCAGCCCCGGACAACAAGCCAGCCCCGGGAAGTACT 
ISe^AspVa I Thr Ser Al aP roAspAsnLysProAI aProGI ySer Thr 
631 GCTCCACCAGCACACQGTGTTACCTCGGCTCCGGATACCAGGCCG 
211^A1 aProProAlaHl sGl yVaiThrSer Al aProAspThr A rgPro 
67 6 GCCCCAGGTAGTACCGCCCCTCCTGCCCATGGTGTCACATCTGCC 
226^AI aProGI ySerThr Al aProProAl aHi sGf yVal Thr Ser Al a 
72 1 CCGGACAACAGGCCTGCATTGGGTAGTACAGCACCGCCAGTACAC 
241^ ProAspAsnArgProAl aLeuGI ySerThr Ai aProProVal Hi s 
766 AACGTTACTAGTGCCTCAGGCTCTGCTAGCGGCTC AGCTTCTACT 
256>'AsnVa I Thr Ser Al aSer GI ySer Al aSer GI ySer Al aSerT)ir 
811 CTGGTGCACAJlCGGaACCTCTGCGCGCGCGACCACAACCCCAGCG 
271> LeuVal HI sAsnGI yThr Ser Al aArgAI aThr Thr Thr ProAl a 

856 agcaagagcactccattctcaattcccagctga™ 
286^SerLysSerThrProPheSer tleProSerf . 
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Figure 9 

1 Ara::AGATCTTCGTGAAGACCCTGACTGGTAAGJW:CATCACTCTC 
!► Me t Ql nl I ePheVa I Ly sThr LeuThr Gf yLysThr r i eThr Leu 
46 GAAGTGGAGCCGAGTGACfiCCATTGAGAATGTCAAGGCAAAGATC 
16V Gi uVa 1 G! uProSerAspThr i i eGl UAsnVal LysAI aLys 1 1 e 
91 CAAGACAAGGAAGGCATCCCTCCTGACCAGCAGAGGCTCATCTTT 
31V GI nAspLy sGl uGl y 1 1 eP roP roAspG! nGI n ArgLeu 1 I ePhe 
13 5 GCAGGCAAGCAGCTGGAAGATGGCCGCACTCTtTCTGACTACAAC 
46V Al aGI yLysGl nLeuGI uAspGl yA rgThr LeuSerAspTyrAsn 
181 ATCCAGAAAGAGTCCACCCTGCACCTGGTGCTCCGTCTCAGAGGT 
6lV i I eGi nLysGI uSerThrLeuHi sLeuVat LeuArgLeuArgGI y 
226 GGGAGGCACGGTAGTGGTGCATGGCTGTTGCCCGTCTCGCTGGTG 
76V GI yA rgHl sGI ySer Gi yAl aTrpLeuLeuProVal Ssr LeuVai 
271 AAAAGAAAAACCACCCTCGCGCCCAATACGCAAACCGCCTCTCCC 
91V LysArgLysThrThr LeuAl aProAsnThr Gi nThr Al aSer Pro 
316 CGCGCGTTGGCCGATTCATTAATGC A6CTGGCACGACAGGTTTCC 
lOSV A rgAI aLeuAi aAspSer LeuNte t GI nLeuA! aArgGI nVal Ser 
361 CGAGGATCCGGCTCAGCTTCTACTCTGGTGCACAACQGCACCTCT 
121V A rgGl ySer G! ySer Al aSer Thr LeuVa i Hi sAsnGI yThr Ser 
406 GCCAGGGCTACCACAAGCCCAGGCAGCAAGAGCACTCCATTCTCA 
136VA! aArgAi aThrThrThr ProAI aSer LysSerThr ProPtieSer 
451 ATTCCCAGCCACCACTCTGATACTCCTACCACCCTTGCCAGCCAT 
151V I ! eProSer Hi sHisSerAspTlirProTtirThrLeuAiaSerHts 
496 AGCACCA.z^GAGTGATGCCAGTAGCACTCACCATAGCACGGTACCT 
166VSerThrLysTtirAspAlaSerSerThrHisHi sSerThr Val Pro 
541 CCTCTCACCTCCTCCAATCACAGCACTTCTCCCCAGTTGTCTACT 
18lV ProLeuThr Ser SarAsnHi sSer Thr Ser ProG! nLeuSer Thr 
5S6 GGGGTCTCTTTCTTTTTCCTGTCTTTTCACATTTCAAACCTCCAG 
• 196V Gi yVai Ser PhePhePlieLeuSer PheHi s I t eSerAsnLeuGI n 
631 TTTAATTCCTCTCTGGAAGATCCCAGCACCGACTACTACCAAGAG 
211V PheAsnSer Set LeuGI uAspProSerThrAspTyrTyrGI nGi u 
676 CTGCAGAGAGACATTTCTGAAATGTTTTTGCAGATTTATAAACAA 
226V LeuGI nArgAsp 1 1 eSerGI u^/tet PheLeuGl n 1 1 eTy rLysGl n 
721 GGGGGTTTTCTGGGCCTCTCCA-ATATTAAGTTCAGGCCAGGATCT 
241V G! yG! yPheLeuGI yLeuSerAsn [ I eLysPheArgProGl ySer 
766 GTGGTGGTACAATTGACTCTGGCCTTCCGAGAASGTACCATCAAT 
256V Va! VatValGlnLeuThrLeuA! aPheArgGluGiyThr 1 1 eAsn 
,811 GTCCACGACGTGGAQACACAGTTO^TCAGTATAAAACGGAAGGA 
27lVVal Hi sAspVai Gf uThrGi nPheAsnGI nTyrLysThrGtuAi a 
856 GCCTCTCGATATAACCTGACGATCTCAGACGTCAGCGTGAGTGAT 
286VA1 aSerArgTyrAsnLeuThr 1 1 eSerAspVal Ser Val SerAsp 
901 GTCCC ATTTCCTTTCTCTGCCC AGTCTGGGGCTGGGGTGCCAGGC 
30lV Val ProPheProPheSerAl aGi nSerGi yAl aGI yValProGly 
946 TGGGGCATCGCGCTGCTGGTGCTGGTCTGTGTTCTGGTTGCGCTG 
316VTrpGl y 1 1 eAi aLeuLeuVat LeuVal CysVal LeuValAl aLeu 
991 GCCATTGTCTATCTCATTGCCTTGTGATAA 
331V Al a! I .eVaf TyrLeu I I eAl aLeu* *••.•• 
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Figure 10 



1 ATGCAGATCTTCGTGAAGACCCTGACTGGTAAGACCATCACTCTC 
l^MetG! nl lePheValLysThrLeuThrGlyLysThr I leThrLeu 

46 GAAGTGGAGCCGAGTGACACCATTGAGAATGTCAAGGCAAAGATC 

IS^Gi uVal Gl uProSerAspThr 1 1 eGl uAsnVal LysAI aLys | le 
■ 91 CAAGACAAGGAAGGCATCCCTCCTGACCAGCAGAGGCTCATCTTT 

ai^G! nAspLysGi uGly i I eProProAspGi nGl nArgLeuJ t ePhe 
13 6 GCAGGCAAGCAGCTGGAAGATGGCCGCACTCTTTCTGACTACAAC 

46>'AI aGI yLysGt nLeuGI uAspGi yA rgThr LeuSerAspTyrAsn 
181 ATCC AG AAAGAGTCCACCCTGCACCTGGTGCTCCGTCTCAGAGGT 

61^ i I eGl nLysG! uSerThr LeuHi sLeuVal LeuArgLeuArgGI y 
226 GGGAGGCACGGTAGTGGTGCATGGCTGTTGCCCGTCTCGCTGGTG 

76>-G! yA rgHi sGI ySer G! yAl aT rpLeuUeuProVal Ser LeuVa! 
271 AAAAGAAAAACCACCCTGGCGCCCAATACGC AAACCGCCTCTCCC 

91> LysArgLysThrThr LeuAl aProAsnThr Gi nThr A! aSer Pro 
316 CGCGCGTTGGCCGATTCATTAATGC AGCTGGCACGACAGGTTTCC 
106>A rgAlaLeuAlaAspSerLeuMetGI nLeuAlaArgGI nVal Ser 
361 CGAGGATCCCTGGTGCTGGTCTGTGTTCTQGTTGCGCTGGCCATT 
■i21^A rgGI ySerLeuVal LeuVal CysVal LeuVal A! aLeuAl al I e 
■ 4 0 6 GTCTATCTCATTGCCTTGGCTGTCTGTCAGTGCCGCCGAAAGAAC 
136> ValTyrLeul 1 eAI aLeuAI aVal CysGl nCysArg ArgLysAsn 
451 TACGGGCAGCTa3ACATCTTTCCAGCCCGGGATACCTACCATCCT 
IBl^TyrGI yGl nLeuAsp l.I ePheProAl aArgAspThr TyrHi sPro 
496 AOXS-AGCGAGTACCCC ACCTACCACACCCATGGGGGCTATGTGCCC 
l66VMet Ser Gl uTyrProThr TyrHi sThr Hi sGI yA rgTyrVa j P ro 
541 CCTAGCAGTACOGATCGTAGCCCCTATGAGAAGGTTTCTGCAGGT 
181^P roSerSerThrAspArgSer ProTyrG! uLysVal Ser Al aQI y 
586' AATGGTGGCAfiCAGCCTCTCTTACACAAACCCAGCAGTGGCAGCC 
196^AsnGI yGl ySer Ser LeuSer Ty rThrAsnProAl aValAI aAl a 
631 ACTTCTGCCAACTTGTGATAA 
211>.ThrSer A! aAsnLeu 
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Figure 1 1 

1 ATGCAGATCTTCGTGAAGACCCTGACTGGTAAGACCATCACTCTC 
l^Met Gl nl I ePheVal LysThr LeuThr Gt yLysThr 1 1 eThr Leu 

46 GAAGTGGAGCCGAGTGACACCATTGAGAATGTCAAGGCAAAGATC 
16>GI uVal Gi uProSerAspThr I ! eGl uAsnValLysAI aLys I le 
91 CAAGACAAGGAAGGCATCCCTCCTGACCAGCAGAGGCTCATCTTT 
31^G! nAspLysGI uGt y i ! eProProAspGl nGl nArgLeul l ePhe 
13 6 GCAGGGAAGCAGCTGGAAGATGGCCGCACTCTTTCTGACTACAAC 
46^A1 aG! yLysGi nLeuGi uAspGl yA rgThr LeuSerAspTyrAsn 
181 ATCCAGAAAGAGTCCACCCTGCACCTGGTGCTCCGTCTCAGAGGT 
6l> I leGlnLysGluSerThrLeuHisLeuVal LeuArgLeuArgGI y 
226 GGGAGGCACGGTAGTGGTGCATGGCTGTTGCCCGTCTCGCTGGTG 
76 ► Gl yA rgHi sGl ySer Gl yAt aT rpLeuLeuProVal Ser LeuVa I 
271 AAAAGAAAAACCACCCTGGCGCCCAATACGCAAACCGCCTCTCCC 
91^ LysArgLysThrThrLeuAl aPr.oAsnThrGI nThr Ai aSer P ro 
316 CGCGCGTTGGCCGATTC ATTAATGC AGCTGGCACGACAGGTTTCC 
106> A rgAI aLeuAl aAspSer Leu^/tetGI nLeuAi aArgGl nVal Ser 
361 CC-AGGATCCACAGGTTCTGGTCATGCAAGCTCTACCCCAGGTGGA 
■l21>'ArgGI ySer Thr Gt ySer Gi yHi sAI aSer Ser Thr P roGI yGl y 
405 GAAAAGGAGACTTCGGCTACCCAGAGAAGTTCAGTGCCCAGCTCT 
136>GI uLysGI uThr Ser AI aThr Gi nArgSer Ser Va l ProSerSer 
451 ACTGAGAAGAATGCTGTGAGTATGACCAGCAGCGTACTCTCCAGC 
151^ Th r Gl uLys As n AI aVa I Ser t Th r Se r Se r Va i Le u Se r Ser 
496 CACAGCCCCGGTTCAGQCTCCTCCACCACTCAGGGACAGGATGTC 
1 66 ► H i s Se r P ro Gl ySe r Gl ySe r Se r Th r Th 1" Gl rr 6! yG i n As pVa I 
541 ACTCTGGCCCCGGCCACGGAACCAGCTTCAGGTTCAGCTGCCACC 
181^ Thr LeuA! aProAJ aThr Gl.uP roAl aSer GI ySer AI aAl ^Thr 
586 TGGGGACAGGATGTCACCTCGGTCCCAGTCACCAGGCCAQCCCTG 
196>TrpGI yG! nAspVal Thr Ser Val ProVal Thr ArgProAl aLeu 
631 GGCTCCACCACCCCGCCAGCCC ACGATGTC ACCTCAGCCCCGGAC 
2li^ Gl ySer Thr Thr ProP roAi aHi sAsp Val Thr Ser AI aP roAsp 
■676 AACAAGCCAGCCCCGGGAAGTACCGCTGCACCAGCACACQGTGTT 
226> AsnLysProAl aProG! ySer Thr AI aProProAt aHi sGI yVal 
721 ACCTCGGCTCCGGATACCAGGCCGGCCCCAGGTAGTACCGCCCCT 
241^ThrSer AI aProAspThr ArgP roAl aProGI ySerThr AI aPro 
766 CCTGCCCATGGTGTCACATCTGCCCCGGACAACAGGCCTGCATTG 
256^ ProAl aHi sGI yValThr Ser AI aProAspAsnArgProAl aLeu 
■ 811 GGTAGTACAGCACCGCCAGTACACAACGTTACTAGTGCCTCAGGC 
271^ G! ySer Thr A! aP roP roVa I HI sAsnVa 1 Thr Ser Ai aSer Gl y 
856 TCTGCTAGCGGCTCAGCTTCTACTCTGGTGCACAACGGCACCTCT 
■286> Ser AI ^Ser Gi ySer AI aSer Thr LeuVa I HI sAsnGl yThr Ser 

(Continued) 
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Figure 1 1 (continued) 

901 GCGCGCGCGACCACAACCCCAGCGAGCAAGAGCACTCCATTCTC A 

301>A!aArgAI aThrThrThrProAi aSer LysSerThrProPheSer 

946 ATTCCCAGCCACCAOTCTGATACTCCT ACCACCCTTOZCAGCCAT 

316V 1 leProSer Hi sHi sSerAspThr ProThrThr LeuAl aSer Hi s 

991 AGCACCAAGACTGATGCCAGTAGCACTCACCATAGCACGCTACCT 

331> SerThrLysThrAspAlaSerSerThrHisHisSerThr Val Pro 
1036 CCTCTCACCTCCTCCAATCACAGCACTTCTCCCCAGTTGTCTACT 

346V ProLeuThr Ser SerAsnHf sSerThr Ser ProGI nLeuSerThr 
1081 GGGGTCTCTTTCTTTTTCCTGTCTT TTCACATTTCAAACCTCC AG 

361V GI yVal Ser PhePhePheleuSer PheHt s 1 1 sSerAsnLeuGI n 
112 6 TTTAATTCCTCTCTGGAAGATCCCAGCACCGACTACTACCAAGAG 
■ 376V PheAsnSer Ser LeuGI uAspP roSer ThrAspTy rTy rGI nGi u 
117 1 CTGCAGAGAGACATTTCTGAAATGTTTTTGCAGATTTATAAACAA 

39lV LeuGJ nArgAspi leSerGI uMet PheLeuGI n I I eTyrLysGI n 
'12 1 6 GGGGGTTTTCTGGGCCTCTCCAATATTAAGTTCAGGCCAGGATCT 

. 406V Gl yGf yPheLeuG! yLeuSerAsn I I eLysPheArg ProGI ySer 
■ 1261 • GTGGTGGTACAATTGACTCTGGCCTTCCGAGAAGGTACCATCAAT 

421V Val Val Val Gl nLeuThr LeuAl aPheArgGI uGf yThr ! I eAsn 
1306 GTCCACGACGTGGAGACACAGTTCAATCAGTATAAAACGGAAGCA 

436V Va I HI sAspVal Gl uThr Gl nPheAsnG! nTy rLy sThr Gl uAt a 
1351 GCCTCTCGATATAACCTGACGATCTCAGACGTCAGCGTGAGTGAT 

451VAI aSer ArgTyrAsnLeuThr I I eSerAspVal Ser Va I SerAsp 
1396 GTGCCaTTTCCTTTCTCTGCCCAGTCTGGGGCTGGGGTGCC^^^ 

466V Val ProPheProPheSerAI aGI nSer Gl yAl aG! yVal ProGI y 
1441 TGGGGCAT03CGCTGCTGGTGCTGGTCTGTGTTCTGGTTX^^^ 

48lVTrpGlyl I eAI aLeuLeuVa I LeuVa t CysVat LeuVa I A I aLeu 
1486 GCC ATTGTCT ATCTCATTGCCTTGGCTGTCTGTCAGTGCCGCCGA 

.49'6VAI'at i eVal TyrLeul 1 eAI aLeuAl aVai CysGl nCysArgArg 
1531 AAGAACTACGGGCAGCTGGAC ATCTTTCCAGCCCGGGATACCTAC 

511V LysAsnTy rGI yGl nLeuAspl lePheProAl aArgAspThr Tyr 
1576 CATCCTATGAGCGAGTACCCCACCTACCACACCCATGGGCGCTAT 

526V Hi sProMet Ser Gl uTyrProThr TyrHt sThr Hi sGJ yA rgTyr 
1621 GTGCCCCCTAGCAGTACCGATCGTAGCCCCTATGAGAAGGTTTCT 

541V Val ProProSerSerThrAspArgSer ProTyrGl uLysVal Ser 
1666 GCAGGTAATGGTGGCAGC AGCCTCTCTTACACAAACCCAGC AGTG 

556VAIaGl yAsnGIyGIySerSer LeuSerTyrThrAsnProAl aVal 
1711 GCAGCCACTTCTGCCAACTTGTGATAA 

571V Al aAl aThr Ser A^aAsnLeu 
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Figure 13 
1 CCAC 

51 GGACATTCCftATCaTAQQCTGCCCftTCCSiCCCTC^raTGTlXTCCTGra^ 
101 TTAGGTCACrrMCAAAMGSAMTTGQGTAGGGGTTTO^ 
151 TTTCm0(MTMTTTTAAAArATCTGGGAAGTCCCTTCCaC3?GC5X5^ 
201 TCCAGAAGTOTTGGTmCAGOCCAaiaATGTCyiACAG^ 
251 AGCTGTCAOTrJKSCaiCAAGGGCOCAACACCCTGCTGfta^ 
' 301 GTGGTTGCTS^GTTAGTAA^a^^3CaAAftCAGG^^^ 
351 TGTGTAOJTTCCAAAATATCTACTfflETTTCATTTSTACTI^TCaGG^ 
401 CCCWGCAOECCACTGGATAAGCATTATCCTTATOCAAAACAGCOTG^ 
451 TCACMGTTCaTCTGCTGACTGTCaACl^STAGCAarajrTGGGGTTACAGT 
501 TTGAGCAQGAmTTGGTCcnSTAGrTTGCTAACACACCCTGC^^ 
551 AAGGTTOCXEACCSACAGCytAAAAflMXJAAAATTrQACrc 
601 TTT(XACCh<XkTmCk'EGk<^^ 
. 651 CaTAGCAffiTTACCCCAATAACCTCAGTTTTAACAGTAACAGCTTOCCACA 
701 TCAAAATATTTCCACAGGTTAAGTCCTCATTrAAATTAGGCAAAG^ 
7.51 CTT<mGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTC 
801 ATGATAATAATGGTrPOTAGACGTCAGGIGGCACTTTTOGGGGAA&TGT 
. 851 GCGCGGAACCCCIATrTGTTTATTTITCTAAATACMTCAAArATGTATC 
901 CGCTCATGaiGACAATAACCCTGATAAATGCTTCAATAATATSGAAAAAGG 
951 AAGAGTATGAGTATTCaACATTTCCGTGTCGCCCTTATTCCCTTTTTTGC 
1001 GGCATTTTGCCTICKTGTTTTTGCTCACCCAGAAACGCTGGa^^ 
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Figure 13 

2151 TAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTOGG 

.2201 TCTGCTAATCCTGT5Aa:AGTGGCTGCTGCCAGTGQCGATMGTCGTGTC 

2251 mCCGGGTTSGACTCaAGRCGATAGmCCGGft.TAAGGCQCAG0GGTC3S 

2301 GGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTA 

2351CACCGAACTGAGAiyVCCTACAGa5TGAGCTATGAGAAAa;GCCaCGCTTC 

2401 CCGAAG(3GA6mGGCGaACAGGTAn:ca3GTAAGC(3aC7^^^ 

2451 GGAGAGOGCAOaGGGftQCITCCAGGGGGAAACGCCTGGTATCTTTATAG 

2501 TCCT0TCX3(3OTTC<MXyiCCTCTGACTTGAGCGTraAlTTITG!^^ 

2551 CG/SCkQSGS3GGC&^GCXrEfill<33htA^^ 

2601 CGGTTCCTGQCCTTTTGCKsGCCTTTTGCTC^ 

2551 ATClXCTGATTCfGTGGATAAGCCTATTACCGCCTlTCAGTGAGCnX^ 

■2701 CCGCTOGCCQaMKCGAACGACOGA<3a3C»GCGAGTCAGTGA<^ 

2751 GCGGAAGAG03CCTQU\TGCGGTATTTTCIHXmCGCATCirai?^^ 

2801 TTCACACCKCATATGG!W3CACTCTCaSTACAATCTGCTCT 

2851 AGTTAA<KCAGTATACaiATCaATATTQGCCATTAQCCATATTAlTCATTG 

2901GT'rATATAGCATAAATCAATM!T<3GCTATTGGCCATTGCATACGTT«3'AT 

2951 CCATATCATAATATGTACATTTATAIETGGCTCATGTCCAACATIAOOGCC 

3001 ATGTTGACATlX3ATTATO3ACTAGTTATTAA!PASrAATCaATTAC^^ 

3051 CATTAGTTCaiTAGCCCATATATOGAGTTCCGCGTirACATAACTTACGerA 

3101 AATGGCCCKCCTGa;TGACCGCCCAACGACCCCrGCCCATTGACBTCAAr 

3151 AATGACGTATGTTCOCATAGTAACGCCAATAGGGACTTTCCATTGACQTC 

3201 AATGGGTGGAGTATTTAOGGTAAAClX3CCCACTT<33CAGTACATCyUVGT6 

3251 TATmTATGCCAAGTAOXCrcmTTGACGTCAATGACGSia^^ 

(Continued) 
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Figure 13 (Continued) 

. 3301 CGCCIGGCKSTkfGCCCmrkCk^ 
3 351 AGTACATCTACGTATTASTCATCGCTATTACCATGGTQATGOQGrTTTGG 
3401 cy^GTACATCAATGGGCGTGSATAGCGGTTTGACTCACQQSGATTra^ 
3451 ICrCCACCXXyiTTGACGTCaATGGGASraTGTTra 
3501 GGSACTTTCCAAAATGTCGTAACAACTCra^CCCATTS^ 
3551 GGTAGGaSTGTACGGTGQGAGGTCTATATAAGClAGftGCTCQTra 
3601 C03ICAQM?CGCCTBGaGACGaaiTCCAC<KnrQTm6RCCTC^ 
3651 GACACCGGGACCGAICrAGCCiaiGCGGCCGGGAACGGTGCATTQGA^ 
3701 C^ATICCCCGTGCCAAGAAAQCrTGTCTAGAACX;CGGSAGAGCTCCT^ 

. 3751 QPAC^TCkQGGTGhC^rSlQGQSk^ 
3801 TTGTAAAATTCATGTTATATGGAGGGGGCAAAGTTiPTCAGGGTGTTGTTT 
385lAGAArGa3AAGATGTCXX:TTGrATCACCATGGACCCTCATGATAATTTTG 
3901 TTTCTTTCACTTTCTACrCICTTGACAACCAirTGSX^PCCTCmTrT^ 
3951 TTTCATETTCTGTAACTTTTTCGTTAAACIP'rTAGCTTGCATTTGTAACeA 
4001 ATlTrTAAATTCACTTTTeriTATTTGTCAGATTGrAAQTACrTTCrCTA 
405iATCACTTOTTTTCMQGCAATCAGGGrATATTATA'rTGTACTTCAGCAC 
4101 AGTTTTAGAGAACAATTGTTArAATTAAATGATAAGGTAGAATATTTCTG 
41 51 CATATAAATTCTGGCTGGCGTGGAAATATTCTTATTGGTAGAAACAACTA 
4201CATCCTGGTCATCATCCTGCCTTTCTCTTTATGGTTACAATGATATACAC 

4301 AACCATGTTCATGCCTTOT'CTTTTTCCTACAGCrCCTQOSCAACGTGCT 

4351 GGTrGTTGTGCTGTCTCATCATTTTGGCAAAGAATTCACTCCTCAGGTGC 

4401 AGGCTGCCTATCAGAAGGTQGTGSCTGGTGTGGCCAATGCCX^^ 

(Continued) 
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Figure 13 (Continued) 

1051 MGATGCTGAAGATCAOTTQGaraCACGAGTGG^ 
llOlCTCAAGJWKffiGTMGATCCTTGAGAGTTTTCSCCCnSMGAACem 
1151 MTGATGACCACTTTTAMGTTCTSCTAT(3TGGa3CK^ 
1201 TSGkOSCOSGGCRkGAGCKkC^CG^ 

1251 SACTXajKGftGTACTGACCACTO^CAGAAMGCATmAC^JATQGCaT 
1301, GACAGTAAGAGAATl!ATGCAGTGCrGa:^lTAACCATGAGT6ATAA^^ 
1351 CGXCMCTrflCTTCTGACMCGfl3N:a3AGSACC3{3AAGS^ 
1401 TTOTGCaCAACftTG<KGGATaATOTAACTCGCCTTGATCGTTGQGAACC 
1451 GGAGCTQAATOa\AGCCATACCAAAOGACGAGCGTGACA(X^CX^^ 
1501 CAGCAAIGGCAACAACGTTGaXMACTATTAACTGGaSAACTACT^ 
1551 CTAGCXTCCOGGCAACAATTAftlAGRCTGGATGSAGGCGGATAflACTTGC 
1601 AGGACCa^CTTCTGCGCTCGGCCCTTCOSGCTQGCTGGTmTTO^ 

ITOlCCAGATGGTAAGCCCTCCCCTATCGTAGmTCTACACGACGGGGAGTCA 
. 1751 GGCAACTATGGATSAACGAAATAGACAGATC6CTGAGATAGOTGCCTCAC 
1801 TGATTAAGCATT(3JTAACTGTCA6ACCAAGTTTACTCATA1:ATACITTAG 

1851 attgatttaaaacttcaxttttaatttaaaaggatctaggtgaagatc^ 

1901 ttttgatmtctcatgaccaaaatcccttaacgtgagttttcgttccact 
19 51 gagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatccttst 

2001TTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAQC 

2051 GCTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAA 
2101C!EGGCTICAQCaGAGO3CAGATACCAAATACTGTCCTTCTAGTGTAGCCG 

(Continued) 
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19/19 

Figure 13 (Continued) 

4451 AAM?ACX3ACTGAGATCTTTTTCCCTCTGCCi!AftAMTAT^^ 

4501 GKhGCCCCTSOkGCATCSGhC^ 

4551 TTGCAATAGXGTGraGGAAITTTTTC?CGTCTCTCACTCG6MGGACATAT 
4601 GGGAGQGCAAATCATTTAAAACATCaySSUVTGAGTATITl^TTT^^ 
4651 GGCAACATATGCCATATGCTGGCTGCCATGMCAAAQGTGGCTATAAA^ 
4701GGTCATCAGTATATCAAACAGCC^CCTGCTGTCCATTCKmTrCCATAG 
■ 4751 AAAAOJCTimCTTOAGGmGATTTTTTTTATATTTTGTTTOS^ 
4B01 TTTTTTCTTTAACATCCCTAAAATTTTCCTTACATGTTmC'rAGCCAGft 
4851 TTTTTCCTCCrCTCCXGACTACTCCCAGTCATAGCTGTCCCTCXTCTCTG 
4901 GATCC 
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SEQUENCE LISTIHG 
<.110> MEKARINI HICERCHE S.p.A. 

<120? PHftSMACEUTICW. COMPOSITIOH, CONTAINIHG FRAGMENTS OF AN 

jusrriaBNic pkoteih bhcoding dua endowed with anti- tumor 

EFFECT 

<130> 5eS3MEUR 
<140> 

<lSOs. MI9BA002330 
<J.51> 1999-10-30 

<:160> 35 

<no> Patentln. Ver. 2.1 

<210> 1 
<2U> 213 

<213> human 

<400> 1 

atgscaggtt ctggtcatgc 
cagagaagtt cagtgcccag 
ctecceagcc acagccccgg 
gccccggcca cggaaccagc 



aagctctacc ccaggtggag 
stctactgag aagaatgctg 
tecaggctcc tccaccactc 
ttcaggttga taa 



aaaaggagac ttcggctacc 60 
tgagfcatgac cagcagcgta 12 0 
agggacagga tgtcactctg ISO 

213 



x:210> 2 

■<211> 525 

<212> DilA 

<213> human 

^400> 2 

atcgtgccca gctctactga gaagaatgct 
cacagccccg gttcaggctc ctccaccact 
acggaaccag cttcaggttc agctgccacc 
aceaggceag ccctgggctc caccaccccg 
aacaagccag ccccgggaag tactgctcca 
accaggccgg ccccaggtag tacf^cccct 
aacaggcetg cattgggtag Cacagcaccg 
tctigctagcg gctcagcttc taetctggtg 
accccagcga gcaagagcac cccattctca 



gtgagtatga ccageagcgt actctccagc 60 
cagggacagg atgtcactct ggccccggcc 120 
tggggacagg atgtcscctc ggtcccagtc 180 
ceagcceacg atgtcacctc agccccggac 240 
ccagcacacg gtgttacctc ggctccggat 300 
cctgcccatg gtgtcaoatc tgccccggac 360 
ccagtacaca acgttactag tgcctcaggc 420 
cacaacggca octctgcgcg cgcgaccaca 480 
attcccagct gataa 525 
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<210> 3 
<311> 654 
<212> DNA 
<Z3.3> human 



<400> 3 



atgggctcag 


ettctactct 


ggtgcacaac 


ggcacctctg 


ccagggctac 


cacaacccca 


€0 


gccagcaaga 


gcactccact 


ctcaattccc 


agccaccact 


ctgatactcc 


taccaccctt 


12 0 


gccagccata 


gcaccaagac 


tgatgccagt 


agcactoacc 


atagcacggt 


acctcctctc 


leo 




atcacagoac 


ttctccccag 


ttgtctactg 


gggtctcttt 


ct ttttcctg 


240 


Ccttttcaca 


tttcaaacct 


ccagtttaat 


tcctctotgg 


aagatcccag 


caccgactac 


300 


tac:c:aagagc 


tgcagagaga 


catttctgaa 


atgtttttgc 


agatttataa 


acaagggggt 


360 


tttctgggcc 


tctccaatat 


taagttcagg 


ccaggatctg 


tggtggtaca 


attgactctg 


420 


gccttccgag 


aaggtaccat 


caatgtccac 


gacgtggaga 


cacagttcaa 


tcagtataaa 


480 


acggaagcag 


cctctcgata 


taacctgacg 


atcecagacg 


tcagcgtgag 


tgatgbgcca 


S40 


tttcctttct 


ctgcccagtc 


tggggctggg 


gtgccaggct 


ggggcatcgc 


gctgccggtg ■ 


600 


ctggtctgtg 


ttctggttgc 


gctggccatt 


gtotatctca 


ttgcettgtg 


ataa 


654 


<210> 4 














<211> 285 














<212> DHA 














<2i3> human 












<400> 4 














atgctggtgc 


tggtctgtgt 


tctggttgcg 


ctggccattg 


tctatctcat 


tgcettggct 


60 


gtctgtcagt 


gccgccgaaa 


gaactacggg 


cagctggaca 


tct t tccagc 


ccgggatacc 


120 


taecatcoca 


tgagcgagta 


ccccacctac 


cacacccatg 


ggcgctatgt 


gccccctagc ISO 


agcaccgatc 


gtagccccta 


cgagaaggtt 


tctgcaggta 


at^tggeag 


cagccfcctct 


240 


tacacaaacc 


cagcagtggc 


agccacttet 


gccaacttgt 


gataa 




285 


<210> 5 














<21i> 1371 














<212> DNA 














<213:> human 












<400> 5 














atgacaggtt 


ctggtcatgc 


aagctctacc 


ccaggtggag 


aaaaggagac 


ttcggctacc 


SO 


cagagaagtt 


cagtgcccag 


ctctactgag 


aagaatgctg 


tgagtatgac 


cagcagcgta 


12 0 


ctctccagcc 


acagccccgg 


ttcaggctcc 




agggacagga 


Cgtcactctg 


180 


gccccggcca 


cggaaccagc 


ttcaggttca 


gctgccacct 


ggggacagga 


tgtcacctcg 


240 


gtcccagtca 


ccaggccagc 


cctgggctce 


accaccccgc 


cagcccacga 


tgtcacctca 


300 


gccccggaca 


acaagccagc 


cccgggaagt 




cagcacacgg 


tgttacctcg 


360 


gctccggata 


cc aggccggc 


cccaggtagc 


accgcccctc 


ctgcccatgg 


dgtoacatct 


420 


gccccggaca 


acaggcctgc 


attgggtagt 


acagcaccgc 


cagcacacaa 


cgctaetagt 480 


gcctcaggct 


ctgacagcgg 


cteagcttct 


actctggtgd 


acaacggcac 


ctctgcgcgc 


540 
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gcgacccLcaa ccccagcgag oaagagcact ccattctcaa ttcccagcca ccactctgat 600 
actcctacca cccttgccag ccatagcacc aagactgatg ccagtagcac tcaccatagc 660 
acggtacctc ctcteaceta ctccaatcac agcacttctc ccoagttgtc tactggggCc 720 
tccctctttt tcctgtcttt tcacatttca aacctccagt ttaattcctc tctggaagat 7ao 
cccagcaccg actactacca agagctgcag agagacattt ctgaaatgtt tttgcagatt 840 
tataaacaag ggggttttct gggcctctcc aatattaagt tcaggqcagg atctgtggtg SOO 
gcacaattga ctctggcctt ccgagaaggt accatcaatg tccacgacgt ggagacacag 950 
ttcaatcagt ataaaacgga agcagcctct cgatataacc tgacgatctc agacgtcagc 1020 
gtgagtgatg tgccatttcc tttctctgec cagtctgggg ctggggtgcc aggctggggc X080 
atcgcgctgc tggtgctggt ctgtgttctg gttgcgctgg ccattgtcta tctcattgcc 114 0 
ttggctgtcc gtcagtgccg ccgaaagaac tacgggcagc tggacatctt tccagcccgg 120 0 
gatacctacc atcctatgag cgagtacccc acctaccaca cccafcgggcg ctatgtgccc 1260 
cctagcagta ccgatcgtag cccctatgag aaggtttctg caggtaatgg tggcagcagc 1320 
Ctetcttaca caaacccagc agtggcagcc acttctgcca acttgtgata a 1371 



<210> 6 
<211> 369 
<212> DBA 
<213> htmtan 

<:400:> € 

atgcagatct tcgtgaagac cctgactggt 
gacaccattg agaatgtcaa ggcaaagatc 
cagaggctca tctttgcagg caagcagctg 
atccagaaag agtccaocet gcacctggtg 
ggtgcatggc tgttgcecgt ctcgctggtg 
caaaccgcct ctccccgcgc gttggccgat 
cgaggatcc 



aagaccatca 


ctctcgaagt 


ggagccgagt 


60 


caagacaagg 


aaggcatccc 


tcctgaccag 


130 


gaagatggcc 


gcactctttc 


tgactacaac 


180 


ctccgtctca 


gaggtgggag 


gcacggtagb 


240 


aaaagaaaaa 


ccaccctggc 


gcccaatacg 


300 


tcattaatgc 


agctggeacg 


acaggtttcc 


360 
269 



<21Q> 7 
<211> 579 
<212> VNA 
<213> human 

■<400> 7 

atgcagatct tcgtgaagac cctgactggt 
gacaccattg agaatgtcaa ggcaaagatc 
cagaggctca tctttgcagg caagcagctg 
atccagaaag agtccaccct gcacctggtg 
ggtgcatggc tgttgcecgt ctcgctggtg 
caaaccgcct ctccccgcgc gttggccgat 
cgaggatcca caggttctgg tcatgcaagc 
gctacccaga gaagttcagt gcccagctct 
agcgtactct ccagccacag ccccggttca 
actctggccc cggccacgga accagcttca 



aagaccatca 


ctctcgaagt 


ggagccgagt 


60 


caagacaagg 


aaggcatccc 


tcctgaccag 


120 


gaagatggcc 


gcactctttc 


tgactacaac 


180 


ctccgtctca 


gaggtgggag 


gcacggtagt 


240 


aaaagaaaaa 


ccaccctggc 


gcccaatacg 


300 


tcattaatgc 


agctggeacg 


acaggtttcc 


360 


tctaccccag 


gtggagaaaa 


ggagacttcg 


420 


actgagaaga 


atgctgtgag 


tatgaccagc 


480 


ggctcctcca 


ccactcaggg 


acaggahgtc 


540 


ggttgataa 






579 
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<210> 8 
<211> 891 

<213> humaa 



■ <40O> D 



atgcagafeot 


tcgtgaagac 


cctgactggt 


aagaccatca 


ctctcgaagfi 


ggagcagagc 


50 


gacaecattg 


agaatgtcaa 


ggcaaagatc 


caagacaagg 


aaggcatccc 


tcctgaccag 


120 


cagaggctca 


tctttgcagg 


caagcagctg 


gaagatggcG 


gcactctttc 


tgactacaac 


180 


atccagaaag 


agfcccaccct 


gcacctggtg 


ctccgtctca 


gaggtgggag 


gcacggtagt 


240 


ggtgcatggc 


tgttgcccgt 


ctcgctggtg 


aaaagaaaaa 


ccaccctggo 


gcccaatacg 


300 


caaaccgcct 


ctccccgcge 


gttggccgat 


tcattaatgc 


agctggcacg 


acaggtttcc 


360 


cgaggatccg 


tgcccagctc 


tactgagaag 


aatgctgtga 


gtatgaccag 


cagcgtactc 


420 


tccagccaca 


gccccggttc 


aggctcctcc 


accactcagg 


gacaggatgt 


cactctggcc 


430 


ccggccacgg 


aaccagcttc 


aggttcagct 


gccacctggg 


gacaggatgt 


cacctcggtc 


540 


ccagtcacca 


ggccagccct 


gggctccacc 


accccgccag 


cccacgatgt 


eacctaagcc 


600 


coggacaaca 


agccagccGc 


gsgaagtact 


gctccaccag 


cacacggtgt 


tacctcggct 


660 


ccggatacca 


ggccggeeee 


aggcagtacc 


gccccteetg 


cccatggtgt 


eaeatetgce 


720 


ccggacaaca 


ggcctgcatt 


gggta.gtaca 


gcaccgccag 


tacacaacgt 


tactagtgcc 


7B0 


tcaggrccctg 


ctagcggctc 


agcttctact 


ctggtgcaca 


acggcacctc 


tgcgcgcgc^r 


840 


accacaaccc 


cagcgagcaa 


gagcactcca 


ttctcaattc 


ccagctgata 




891 


<210> 9 














<211> 1020 














<212> DNA 














'!213> human 












<400> 9 














atgcagatcC 


tcgtgaagac 


cctgactggt 


aagaccatca 


C tctcgaagt 


ggagccgagt 




gacaecattg 


agaatgtcaa 


ggcaaagatc 


caagacaagg 


aaggcatccc 


tcctgaccag 


120 


cagaggctca 


tctttgcagg 


caagcagctg 


gaagatggcc 


gcactctttc 


tgactacaac 


100 


atccagaaag 


agtccaacct 


gcacctggtg 


ctccgtctca 


gaggtgggag 


gcacggtagt 


240 


ggtgcatggc 


tgttgcccgt 


ctcgctggtg 


aaaagaaaaa 


ccaccctggc 


gcccaatacg 


300 


caaaccgcct 


ctccccgcge 


gttggccgat 


tcattaatgc 


agctggcacg 


acaggtttcc 


360 


cgaggatccg 


gctcagcttc 


cactcUggtg 


cacaacggca 


cctctgccag 


ggctaccaca 


420 


accccagcca 


gcaagagcac 


tccattctca 


attcccagcc 


accactctga 


Cactcctacc 


48 0 


acccttgeea 


gecatagcac 


caagactgat 


gccagtagca 


ctcaccatag 


cacggtacct 


54 0 




cctccaatca 


cagcacttct 


ccccagttgt 


ctactggggt 


ctctttcttt 


eoo 


ttcctgtctt 


ttoacatttc 


aaacctccag 


tttaattcct 


ctctggaaga 


tcccagcacc 


eeo 


gaccactacc 


aagagctgca 


gagagacatt 


tctgaaatgu 


ttttgcagat 


ttataaacaa 


720 


gggggttttC 


tgggcctctc 


caatattaag 


ttcaggccag 


gatctgtggt 


ggtacaattg 


780 


actctggcct 


tccgagaagg 


taccatcaat 


gtccacgacg 


tggagacaca 


gttcaatcag 


840 


tataaaacgg 


aagcagcctc 


tcgatataac 


Ctgacgatct 


cagacgtcag 


cgtgagtgat 


900 


gtgccatttc 


ctttctctgc 


ccagtctggg 


gctggggtgc 


caggctgggg 


catcgcgctg 


360 


ctggtgctgg 


tccgtgttct 


ggttgcgctg 


gccattgtct 


atctcattgc 


ctcgtgataa 


1020 
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<:210> 10 
<.Z11> 651 
<212> DMA 
<213> human 



<400> 10 



atgcagatct 


tcgtgaagac 








ggagccgagt 




' ^acaecattg 


agaatgtcaa 


ggcaaagatc 


caagacaagg 


aaggcatccc 


tcctgaccag 


120 


cagaggctca 


tctttgcagg 


caagcagctg 


gaagatggoc 


gcactctttc 


tgactacaac 


ISO 


^tccagaaag 


agtccaccct 


gcacctggtg 


ctccgtctca 


gaggtgggag 


gcacggtagt 


240 


ggtgcatggc 


tgttgcccgt 


ctcgctggtg 


aaaagaaaaa 


ccaccctggc 


gcccaatacg 


300 


caaaticgcct 


ctccccgcgc 


tjttggecgat 


tcattaatgc 


agctggcacg 


acaggtttcG 


3S0 


cgaggatccc 


tggfcgctggt 


Ctgtgttctg 


gttgcgctgg 


ccattgtcta 


tctcattgec 


420 


ttggctgtct 


gtcagtgccg 


ccgaaagaac 


tacgggcagc 


tggacabctt 


tccagcccgg 


4B0 


gatacctacc 


atcctatgag 


cgagtaeccc 


acctaccaca 


cccatgggog 


ctatgtgccc 


540 


cctagcagta 


ccgatcgtag 


cccctatgag 


aaggtttctg 


caggtaatgg 


tggcagcagc 


6O0 


ctctcttaca 


caaacccagc 


agtggcagcc 


acttctgcca 


acttgtgata 


a 


651 


<210> 11 














<211> 1737 














<212> DNA 














<213> human 












<400> 11 














atgcagatct 


tcgtgaagac 


cctgactggt 


aagaccatca 


ctctcgaagt 


ggagccgagt 


60 


gacaccattg 


agaatgtcaa 


ggcaaagatc 


caagacaagg 


aaggcatccc 


tcctgaccag 


120 


cagaggctca 


tctttgcagg 


caagcagctg 


gaagatggcc 


gcactctttc 


tgactacaac 


180 


atccagaaag 


agtccaccct 


gcacctggfcg 


ctccgtctca 


gaggtgggag 


gcacggtagt 


240 


ggtgcatggc 


tgttgcccgt 


ctcgctggtg 


aaaagaaaaa 


ccaccctggc 


gcccaatacg 


300 


caaaccgcct 


ctccccgcgc 


gttggccgat 


tcattaatgc 


agctggcacg 


acaggtttcc 


3SD 


cgaggatcca 


caggttctgg 


tcatgcaagc 


tctaccocag 


gtggagaaaa 


ggagacttcg 


420 


gctacccaga 


gaagttcagt 


gcccagctct 


actgagaaga 


atgctgtgag 


tatgaccagc 


480 


■ agcgtactct 


ccagccacag 


ccccggttca 


ggcccctcca 


ccactcaggg 


acaggatgtc 


540 


actctggccc 


cggccacgga 


accagcttca 


ggttcagctg 


ccacctgggg 


acaggatgtc 


SOQ 


acctcggtcc 


cagtcaccag 


gccagccctg 


ggctccacca 


ccccgccagc 


ccacgatgtc 


660 


acctcagccc 


cggacaacaa 


gccagecccg 


ggaagtaccg 


ctecaccagc 


acacggtgtt 


720 


acctcggctc 


cggataccag 


gccggcccca 


ggtagtaccg 


cGccccctgc 


ccatggtgtc 


780 


acatctgccc 


cggacaacag 


gcctgcattg 


ggtagtacag 


caccgccagt 


acacaacgtt 


340 


actagtgcct 


caggctctgc 


tagcggctca 


gcttctactc 


tggtgcacaa 


cggcacctct 


900 


gcgcgcgcga 


ccacaacccc 


agcgagcaag 


agcactccat 


tctcaattcc 


cagccaccac 


960 


tctgatactc 


ctaccaccct 


tgccagccat 


agcaccaaga 


ctgatgccag 


tagcactcac 


1020 


catagcaegg 


tacctcctct 


cacctccccc 


aatcacagca 


cttctcccca 


gttgtctact 


108O 


ggggtctctt 


tctttttcct 


gtcttttcac 


atttcaaaec 


tccagtttaa 


ttcctctctg 


1140 


gaagabccca 


gcaccgacta 


ctaccaagag 


Ctgcagagag 


acatttctga 


aatgtttttg 


1200 


cagatttata 


aacaaggggg 


ttttctgggc 


ctctccaata 


ttaagttcag 


gccaggatct 


1260 


gtggtggtac 


aattgactct 


ggcctcc.cga 


gaaggtacea 


tcaatgtcca 


cgacgtggag 


1320 



5 
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acacagttca 


atcagtataa 


aaoggaagca 


gcctctcgat 


ataacctgac 


gatctcagac 


1380 


gtc&gcgtga 


gtgatgtgcc 


atttCGtttc 


tctgcccagt 


ctggggctgg 


ggtgccaggc 


144 0 


tggggcatcg 


qgctgctggi: 


gctggtctgt 


gttctggttg 


cgctggccat 


tgtctatctc 


1500 


attgccttgg 


ctgtctgtca 


gtgccgccga 


aagaactacg 


ggcagctgga 


catctttcca 


1550 


gccogggata 


cctaccatcc 


tatgagcgag 


CaccccaCCt 


aceacaccca 


tgggcgctat 


1€2D 


gtgcccccta 


gcagtaccga 


tcgtagcccc 


tatgagaagg 


Cttctgcagg 


taatggtggc 


1680 


agcagcctct 


cttacacaaa 


cceagcagtg 


gcagccactt 


ctgccaactt 


gtgataa 


1737 


<21D> 13 














<211> 4S05 














<212> DUA 














<213» humai 














<:400> 12 














ccaggaagct 


cctctgtgtc 


ctcataaacc 


ctaacetcct 


ctactbgaga 


ggacattcca 


60 


atcataggct 


gcccatccac 


cctctgtgtc 


ctcctgttaa 


ttaggtcact 


taacaaaaag 


120 


gaaattgggt 


aggggttttt 


cacagaccgc 


tttctaaggg 


taattt^taaa 


atatctggga 


IBO 


agtccctccc 


actgctgtgt 


tccagaagtg 


ttggtaaaca 


gcccacaaat 


gtcaacagca 


240 


gaaacataca 


agctgtcagc 


tttgcacaag 


ggcccaacae 


cctgcecatc 


aagaagcact 


300 


gtggttgctg 


tgttagtaat 


gtgcaaaaca 


ggaggcacat 


tttccccacc 


tgtgtaggtt 


360 


ccaaaatatc 


tagtgttttc 


atttttactt 


ggatcaggaa 


cccagcactc 


cactggataa 


420 


gcattatcct 


tatccaaaac 


agccttgtgg 


tcagtgttca 


tctgctgact 


gtcaactgta 


480 


gcattttttg 


gggfctacagt 


ttgagcagga 


tatttggtcc 


tgtagtttgc 


taacacaccc 


540 


tgcagcjtcca 


aaggttcccc 


accaacagca 


aaaaaatgaa 


aatttgacGc 


ttgaatgggt 


600 


tctccagcac 


cattttcatg 


agttttttgt 


gtccctgaat 


gcaagtttaa 


catagcagtc 


660 


acceeaataa 


cctcagtttt 


aacagtaaca 


gcttcccaca 


tcaaaatatt 


tccacaggtt 


720 


aagtcctcat 


ttaaattagg 


caaaggaatt 


cfctgaagacg 


aaagggcctc 


gtgatacgcc 


780 


tatctttata 


ggttaatgtc 


atgataataa 


tggtttctta 


gacgtcaggt 


ggcacttttc 


840 


ggggaaatgt 


gcgcggaacc 


cctatttgtt 


tatttttcta 


aatacattca 


aatatgtatc 


900 


cgctcafcgag 


acaataaccc 


tgataaatgc 


ttcaataata 


ttgaaaaagg 


aagagtatga 


960 


gtattcaaca 


tttccgtgtc 


gcccttatto 


ccttttttgc 


ggcattttgc 


cttcctgttt 


1020 


ttgctcacec 


agaaacgctg 


gtgaaagtaa 


aagatgctga 


agatcagttg 


ggCgcacgag 


1080 


tgggttacat 


cgaactggat 


ctcaacagcg 


gtaagatcet 


tgagagtttt 


cgccccgaag 


1140 


aacgttttcc 


aatgatgagc 


acttttaaag 


tCCtgctatg 


tggcgcggta 


ttatcGcgtg 


1200 


ttgacgccgg 


gcaagagcaa 


cccggtcgcc 


gcataciacta 


ttctcagaat 


gacttggttg 


1260 


agtactcacc 


agtcacagaa 


aagcatctta 


cggatggcat 


gacagtaaga 


gaattatgca 


1320 


gtgctgccat 


aaccatgacft 


gataacactg 


cggccaactt 


acttctgaca 


acgatcggag 




gaccgaagga 


gctaaccgct 




acatggggga 


tcatgtaact 


cgccttgatc 


1440 


gttgggaaec 


ggagctgaat 


gaagccatac 


caaacgacga 


gcgtgacacc 


acgatgcctg 


1500 


cagcaatggc 


aacaacgttg 


cgcaaactat 


taactggcga 


actacttact 


Gtagcttccc 


1560 


ggcaacaatt 


aatagactgg 


atggaggcgg 


ataaagttgc 


aggaccactt 


ctgcgctcgg 


1620 


cccttccggc 


cggctggttt 


attgctgata 


aatctggagc 


cggtgagcgt 


gggtctcgcg 


1680 


gtatcattgc 


agcactgggg 


ccagatggta 


agccctcccg 


catcgtagtt 


atctacacga 


1740 


cggggagtca 


ggcaactatg 


gatgaacgaa 


atagacagat 


cgctgagata 


ggtgcctcac 


IBOO 


tgattaagca 


ttggtaactg 


tcagaccaag 


cctacteata 


tatactttag 


attgatttaa 


1860 


aacttcattt 


ttaatttaaa 


aggatctagg 


tgaagatcct 


tcttgataat 


ctcatgacca 


1920 


aaatccctta 


acgtgagttt 


tcgttccact 


gagcgccaga 


ccccgtagaa 


aagatcaaag 


1980 
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gatcttcttg agatcctttt tttctgcgcg 
cgctaccagc ggtggtttgt ttgccggatc 
ctggcttcag cagagcgcag ataccaaata 
accacttcaa gaactctgta gcaccgccta 

tggctgctgc oagCggcgat aagtcgtgtc 

cggataaggc gcagcggtcg ggctgaacgg 

gaacgaccta caccgaactg agatacctac 
ccgaagggag aaaggcggac aggtatccgg 

cgagggagct tccaggggga aacgcctggt 

tctgacttga gcgtcgattt ttgtgatgct 

ccagcaacgc ggccttttta cggttcctgg 

. ttcctgcgtt atcccetgat tctgtggata 

cogotcgccg cagccgaacg accgagcgca 

gcctgatgcg gtattttctc cttacgcatc 

ctctcagtac aatctgctct gatgccgcat 

attagocata ttatteattg gttatatagc 

taogttgtat ccatatcata atatgtaeat 

atgttgacat tgattattga ctagttatta 

tagcccatat atggagttcq gcgttacata 

■gcccaacKjac aaacgcccat fcgacgtcaat 

agggMtttc cattgacgtc aatgggtgga 

acatcaagtg tatcatatgc caagtacgcc 

ogcctggoat tatgcccagt acatgacctt 

cgtattagtc ategetatta ccatggtgat 

atagcggttt gactcacggg gatttccaag 

gtttfcggcac caaaatcaac gggaetttcc 

gcaaatgggc ggtaggcgtg tacggtggga 

ccgtcagatc gcctggagac gccatccacg 

ccgatccagc ctccgcggcc gggaacggtg 

- agcttgtcta gaacccggga gagctcctga 

ttgttctttc tttttcgcta ttgtaaaatt 

ggtgttgttt agaatgggaa gatgtccctt 

tttctttcac tttctactct gttgacaacc 

tgtaactttt tcgttaaact ttagcttgca 

ttatttgtca gattgtaagt actttctcta 

attatattgt ac:ttcagcac agttUtagag 

aatatttctg catataaatt ctggctggcg 

catcctggtc atcatcctgc ctttctcttt 

gaggataaaa tactctgagt ccaaaccggg 

Ctttttccta cagctcctgg gcaacgtgct: 

agaattcact cctcaggtgc aggctgccta 

cctggctcac aaataccact gagatctttt 

gaagcccctt gagcatctga cttctggcta 

gtgttggaat Cttttgtgtc tctcactcgg 

acatcagaat gagtatttgg ttfcagagttt 

aacaaaggtg gctataaaga ggtcatcagt 

tattccatag aaaagccttg acttgaggtt 

ttttttcttt aacatcccta ^aattttcct 



taatctgctg 


cttgcaaaca 


aaaaaaccac 


2040 


aagagctacG 


aacfccttttt 


ccgaaggtaa 


2100 


ctgtccttct 


agtgtagccg 


tagttaggcc 


2160 


cafcacctcgc 


tctgctaatc 


ctgttaccag 


2220 


ttaccgggtt 


ggactcaaga 


cgatagttac 


22S0 


ggggttcgtg 


cacacagccc 


agcttggagc 


2340 


agcgtgagct 


atgagaaagc 


gccacgcttc 


2400 


taagcggcag 


ggtcggaaca 


ggagagcgca 


2450 


atctttatag 


tcctgtcggg 


ettcgccacc 


2520 


cgtcaggggg 


gcggagccta 


tggaaaaacg 


2S30 


ccttttgctg 


gocttttgct 


cacatgttct 


2640 


accgtattac 


cgcctttgag 


tgagctgata 


2700 


gcgagtcagt 


gagcgaggaa 


gcggaagagc 


2760 


tgtgcggtat 


ttcacaccgc 


atatggtgca 


2520 


agttaagcca 


gtatacaatc 


aatattggcc 


28B0 


ataaatcaat 


attggctatt 


ggccattgca 


2940 


ttatattggc 


fccatgtccaa 


cattaccgcc ' 


3000 


atagtaatca 


attaoggggt 


cattagttca 


3060 


acttacggta 


aatggcccgc 


ctggctgacc 


3120 


aatgacgtat 


gttcecatag 


taacgccaat 


3180 


gtatttacgg 


taaactgccc 


acttggcagt 


3240 


ccctategac 


gtcaatgacc 


gtaaatggcc 


3300 


atgggacttt 


cctacttggc 


agtacatcCa 


3360 


gcggttttgg 


cagtacatca 


atgggcgtgg 


3420 


tcCccacecc 


attgacgtca 


atgggagttt 


34S0 


aaaatgtcgt 


aacaactccg 


ccccattgac 


3540 


ggtctatata 


agcagagctc 


gfcttagtgaa 


3600 


ctgtcttgac 


ctccatagaa 


gacaccggga 


leso 


cattggaacg 


cggattcccc 


gtgccaagaa 


3730 


gaacttcagg 


gtgagtttgg 


ggacccttga 


3780 


catgttatat 


ggagggggca 


aagttttcag 


3B40 


gtatcaccat 


ggaccctcat 


gataattttg 


3500 


attgtctcct 


cttattttct 


tttcattttc 


3960 


tttgtaacga 


atttttaaat 


tcacttttgt 


402 0 


atcactttct 


tttcaaggca 


accagggtat 


4080 


aacaattgtt 


ataattaaat 


gataaggtag 


4140 


tggaaatatt 


cttattggta 


gaaacaacta 


4200 


atggttacaa 


tgatatacac 


tgtttgagat 


4260 


cccctctgct 


aaccatgttc 


atgccttctt 




ggttgttgtg 


ctgtctcatc 


attttggcaa 


4380 


tcagaaggtg 


gtggctggtg 


tggccaatgc 


4440 


tccctctgcc 


aaaaattatg 


gggacatcat 


4500 


ataaaggaaa 


tttattttca 


ttgcaatagt 


4550 


aaggacatat 


gggagggcaa 


atcatttaaa 


4620 


ggcaacatat 


gccatatgct 


ggcegccatg 


4680 


atatgaaaca 


gccccctgcfc 


gtccattcct 


4740 


agattttttt 


tatattttgt 


tttgcgttat 


4800 


tacatgtttt 


actagccaga 


tttttcctec 


4860 



wo 00/2S827 

tctcctgact actcccagtc atagctgtcc ctcttctctg gatcc 

<210> 3.3 
<211> 31 
<312> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 

oligonucleotide 

<iO0> 13 

gatcggatcc acaggttctg gtcatgcaag c 

<210> X4 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
oiigonucleotide 

<400> 14 

gatctctaga aagettatca acctgaagct ggttccgtgg c 

<210> 15 
<211> 36 
<212> mA 

<213> Artificial Sequence 
<220> 

-<:223> DQscription of Artificial Sequence: sytttbetic 
ol igoraic 1 eot idle 

<400> 15 

gatcggatcc gtgcccagcC ctactgagaa gaatgc 

<210> 16 
<211> 49 
<212> DMA 

<2132. Artificial Ssguence 



<220> 
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<223> DeBcription of Artificial Sequence: synthetic 
oligonucleotide 

<iQQ> 16 

gatctctaga aagcttatca gctgggaatt gagaatggag tgctcttgc 



<21Q> 17 

<211j 40 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
oligonucleotide 

<400> 17 

gatcggatcc ggcteagett ctactctggt gcacaacggc 

<:210> IB 
<211> 45 
<2aa> DWA 

<213j- Artificial Secpience 

<220> 

<223> Description of Artificial Sequence; synthetic 
oligonucleotide 

<400> IB 

gatctctaga aagcttatca caaggcaatg agatagacaa tggcc 

<2ia> 19 

<2H> 38 
<2L2> DNA 

<2i3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
oligonucleotide 

<400? 19 

gatcggatcc ctggtgctgg tctgtgtcct ggttgcgc 



<210> 20 
<;211> 41 
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<2i2> nm 

<213> Artificial Sequence 
c220> 

«2235- Description of Artificial Sequence; synthetic 
oligonucleotide 

<400> 20 

gatctctaga aagcttatca caagttggca gaagtggctg c 41 



<210> 21 
<211? 34 
<212> DNA 

<213> Artificial Sequence 
<2205. 

<223> Degeriptlon of Artificial Seqpience : Bynthetic 
oligonucleotide 

<400i 21 

gatctctaga atgacaggtt ctggtcatgc aagc 34 



<210> 22 
<211> 39 
<212> DNA 

<213> Artificial Saqiiance 
<320> 

<223> Description of Artificial Sequence: synthetic 
oligonucleotide 

<4Q0> 23 

gatctctaga atggtgccca gctctactga gaagaatgc 39 



<210> 23 
<211> 43 
<2i2> DNA 

<213> Artificial Sequence 
<220> 

-;223s> Description of Artificial Sequence: synthetic 
oligonucleotide 



<400> 23 

gatctctaga atgggctcag cttctactct ggtgcacaac ggc 



43 
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<2ao> 24 
<211> 41 
<212> mA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secpienee: aynthetic 
oligonucleotide 

<400> 24 

gatctct&ga atgctggcgc tggtctgtgt tctggttgcg c 



<210> 25 
<211> 26 
<212> DNA 

'<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; synthetic 
oligonucleotide 

<4CI0> 25 

ggcggtggag cccggggctg gcttgt 



<2ia> 26 
<211> 22 
<212> DKA 

<213> Artificial Seqiianco 
<220> 

<223> Description of Artificial Sequence: synthetic 
oligonucleotide 

<400> 26 

aacctgaagc tggttccgtg gc 

<210> 27 
<;211> 26 
<212> ONA 

<213> Artificial Sequence 
<:220> 

<223> Description of Artificial Sequence: synthetic 



11 
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oligonucleotide 

<400> 27 

gtgcccagct otactgagaa gaatgc 

<210> 28 
.<:211> 29 
<:212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Se(i«eiic:e: synthetic 
oligonucleotide 

<400> 2S 

gctgggaatt gagaatggag fcgetettge 

' <210> 29 
4211> 30 
<212> DHA 

<213> Artificial Sequence 

■ <220> 

<223> Description of Artificial Sequence: synthetic 
oligonucleotide 

<iOQ> 29- 

ggctcagctt ctactctggt gcacaacggc 

<2X0> 30 
<211> 25' 
«:212> DNA 

<213> Artificial Sequence 
<a20> 

<222> Description of Artificial Sequence; synthetic 

oligonucleotide 

<400> 30 

caaggcaatg agatagacaa tggcc 

<210> 31 
<211s. 27 
<2i2> DHA 

12 
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<213> Artificial sequence 
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<22Q> 

<223> Description of Artificial Sequence; synthetic 
oligonucleotide 

<40(J> 31 

ctggtgctgg tctgtgttct ggttgcg 21 



-5210> 32 
<211> 40 
<212> DHA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; synthetic 
oligonuelcohide 

<400> 32 

gatctctaga atgcagatct tcgtgaagac cctgactggt 40 



<210> 33 
<211> €8 
<:212> DiJA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
oligonucleotide 

<400> 33 

tcaccagcga gacgggeaac agccatgcac cactaccgtg Cctcccacct ctgagacgga 60 
gcaccagg 68 



<210> 34 

<211> 6S 

<2\2> UNA 

<213> Artificial Sequence 
.<220> 

<223> Description of Artificial Sequence: synthetic 
oligonucleotide 

<400> 34 

cctccgtctc agaggtggga ggcacggtag tggtgcatgg ctgttgcccg tctcgctggt 60 

■13 " 
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6S 



<210> 35 
<:211> 35 
<212> tOTA 

<213> Artificial Sequence 

'<220> 

<:223:» Description of Artificial Seguance; synthetic 

oligonucleotide 



<:400> 35 

gatcggatcc tcgggaaacc tgtcgtgcca gctgc 



35 



